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Abstract. The C^-free process starts with the empty graph on n vertices and adds 
edges chosen uniformly at random, one at a time, subject to the condition that no 
copy of Ce is created. For every £ > A we show that, with high probability as n — oo, 
the maximum degree is 0((nlogn)^/(^~^'), which confirms a conjecture of Bohman and 
Keevash and improves on bounds of Osthus and Taraz. Combined with previous results 
this implies that the Q-free process typically terminates with 6(n^/(^~^^(logn)^/(^~^^) 
edges, which answers a question of Erdos, Suen and Winkler. This is the first result 
that determines the final number of edges of the more general iJ-free process for a non- 
trivial class of graphs H. We also verify a conjecture of Osthus and Taraz concerning the 
average degree, and obtain a new lower bound on the independence number. Our proof 
combines the differential equation method with a tool that might be of independent 
interest: we establish a rigorous way to 'transfer' certain decreasing properties from the 
binomial random graph to the H-iiee process. 

1 Introduction 

The random graph process was introduced by Erdos and Renyi [10] in 1959. It starts with the empty 
graph on n vertices and adds new edges one by one, where each edge is chosen uniformly at random 
among all edges not yet present. Since then it has been studied extensively, and many tools and 
methods for investigating its typical properties have been developed, see e.g. [5l|8l[12]. In this work 
we consider a natural variant of the above process which has very recently received a considerable 
amount of attention [2l[3l[IIllI71[l8l|23l[2ll[l5l[26l[27]. 

The H-free process was suggested by Bollobas and Erdos [1] in 1990, as a way to generate an 
interesting probability distribution on the set of maximal //-free graphs with potential applications 
to Ramsey Theory. Given some fixed graph H, it is a modification of the classical random graph 
process, where each new edge is chosen uniformly at random subject to the condition that no copy 
of H is formed. It was first described in print in 1995 by Erdos, Suen and Winkler [9], who asked 
how many edges the final graph typically has (this also appears as a problem in [7j). The main 
difficulty when analysing this process is that there is a complicated dependence among the edges; 
the order in which they are inserted is also relevant. 

The first results addressed certain special graphs, determining the typical final number of edges 
up to logarithmic factors. The case H = C3 was studied in 1995 by Erdos, Suen and Winkler [9], 
and in 2000 Bollobas and Riordan [6] considered H S {K^jC^}. In fact, a result of Rucihski 
and Wormald [21j predates those mentioned above: in 1992 they considered the (much simpler) 
maximum degree d-process, which corresponds to the case H = -fCi,d+i, and showed that whp0 it 
ends with [nd/2\ edges. The general H-free process was first analysed independently by Bollobas 
and Riordan |6J and Osthus and Taraz [16] in 2000. In fact, they assumed that H satisfies a 
certain density condition (strictly 2-balanced), which holds for many interesting graphs, including 
cycles and complete graphs. Osthus and Taraz determined the typical final number of edges up to 



^ As usual, we say that an event holds with high probability, or whp, if it holds with probability 1 — o(l) as n — s- 00. 
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logarithmic factors and conjectured that whp the average degree in the final graph of the C^-free 
process is ©((nlogn)^/^^"-'^)). 

The next improvements came about ten years later. In a breakthrough in 2009, Bohman [2] obtained 
the first matching bounds: he proved that the Ca-free process ends whp with 0(n^/^\/log n) edges, 
confirming a conjecture of Spencer [22]. Next, Wolfovitz [25] slightly improved the lower bound on 
the expected final number of edges for a range of graphs H. Very recently, for the class of strictly 
2-balanced graphs Bohman and Keevash [3] obtained new lower bounds that hold whp, which 
they conjectured to be tight up to the constants. In fact, their conjecture is for the maximum degree: 
for the Cf-free process they conjectured that the maximum degree is whp at most D{n\ognY/^^^^^ 
for some D > 0. 

As one can see, the typical final number of edges in the H-iree process has attracted a lot of 
attention, and for a large class of graphs H interesting bounds are known. However, not much 
progress has been made in obtaining good upper bounds. After Bohman's result for C3, the next 
case to be resolved was H = K^, for which matching bounds have been obtained by the author |24j . 
and, independently, by Wolfovitz [26]. During the preparation of this paper Picollelli [n\ I18j also 
resolved the cases H G {C4, K^}. But despite this progress, since the upper bound for the maximum 
degree d process in [21] is immediate, one can argue that non-trivial matching upper bounds have 
not been determined for any class of graphs. 

The H-iiee process is nowadays considered a model of independent interest as well. For strictly 
2-balanced H, the early evolution of various graph parameters, including the degree and the number 
of small subgraphs, has been investigated in [3l|27]. These results suggest that, perhaps surprisingly, 
during this initial phase the graph produced by the H-hee process is very similar to the uniform 
random graph with the same number of edges, although it contains no copy of H. Studying the 
typical structural properties, e.g. the degree, in the later evolution of the H-hee process is an 
intriguing problem, and so far only some preliminary results are known, cf. [11] f^S] . 

Motivation for studying the H-hee process also comes from extremal combinatorics, where its 
analysis has produced several new results. For example, improved lower bounds on the Turan 
numbers of certain bipartite graphs and Ramsey numbers R{s, t) with s > 4 have been established 
in [21 131 [25], and Bohman [2] reproved the famous lower bound for R{3,t) obtained by Kim |15j . 
One of the key ingredients for these results is an upper bound on the independence number of the 
H-hee process, cf. [2 [3]. So far only for the special cases H G {€3,04} are these estimates known 
to be best possible, and it would be interesting to obtain good lower bounds for other graphs. 

1.1 Main result 

In this paper we prove a new upper bound on the final number of edges of the C^-free process. In 
fact, we give a new upper bound for the maximum degree, which confirms a conjecture of Bohman 
and Keevash [3] and improves previous upper bounds by Osthus and Taraz |16| . 

Theorem 1.1. For every I > A there exists D > such that whp the maximum degree in the final 
graph of the Cg-free process is at most D{n log n)^^^^~^K 

Up to the constant our upper bound is best possible, since the results of Bohman and Keevash [3] 
imply that for some c > 0, whp the minimum degree is at least c(n log n)^/(^~^). The special case 
i = 4 was proved independently by Picollelli [18]; since this manuscript was submitted Picollelli [19] 
has independently also proved the case i > 4. So, combining our findings with [3], we not only verify 
the mentioned conjecture of Osthus and Taraz [1^, but establish the following stronger result. 
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Corollary 1.2. For every ^ > 4 there exist c,D > such that in the final graph of the C^-free 
process whp the number of edges is between cn^^^^~^\logny/^^^^^ and Dn^/^^~^\\ogny/^^~^\ and 
whp the degree of every vertex is between c(n log n)^/^^""*^^ and D(n\ognY^^^~^\ □ 

This is a natural extension of the main result of Bohman [2j for the Ca-free process, and answers 
a question of Erdos, Suen and Winkler for the C^-free process (see [ZllS]): whp the final graph has 
0(7^^/(^-i)(logn)^''^^~"'^^) edges. Since this question was asked for the H-iiee process in 1995, this is 
the first result that determines (up to constants) the final number of edges for a class of graphs. 

We also obtain a new lower bound on the independence number of the C^-free process. Indeed, as 
pointed out to us by Picollelli, using Corollary 2.4 of Alon, Krivelevich and Sudakov [1], Corollary ll.2l 
implies the following bound conjectured in an earlier version of this paper (together with a proof of 
a weaker bound). 

Corollary 1.3. For every i > 4 there exists c > such that whp the independence number in the 
final graph of the C^-free process is at least c(n log n)(^~^)/(^~^). □ 

Up to the constant this matches the upper bound established by Bohman and Keevash [3]. We infer 
that whp the independence number in the final graph of the C^-free process is Q(n log n)^^~'^^/^^~^^). 

1.2 Comparison with previous work 

The basic idea of the proof is similar to [16J: we show that, after a certain number of steps, every 
pair {v,U) with v ^ U and \U\ = D{nlogn)^^^^~^^ has some property that prevents U C T{v) in 
the final graph of the C^-free process. Osthus and Taraz [16] establish their 0{n^^^^~^^ logn) bound 
for the maximum degree using a 'static' point of view: they couple the C^-free process (or more 
generally the H-iiee process) with the classical random graph process and then show that even after 
deleting all edges contained in a copy of Ci, every (u, U) has the desired property. By contrast, we 
obtain the better 0((n log n)^/(^~^)) bound by tracking the step-by-step effects of each edge added 
in the C^-free process, and our main tool is the differential equation method used in |24j . 

Our argument relates to the proof of Bohman for the Cs-free process as follows. In [2] it is shown 
that every large set of vertices contains at least one edge, which implies a bound on the maximum 
degree, since the neighbourhood of each vertex is an independent set. In other words, the upper 
bound follows from a bound on the independence number. For the C^-free process, £ > 4, the 
maximum degree is a separate question. In particular, we need to consider a more involved event, 
and thus must study the combinatorial structure of large sets more precisely. 

To this end we track several random variables for every (v, U). But, when applying the differential 
equation method, there are significant technical difficulties, and a simple refinement of the approach 
used in [21] for the i^4-free process does not suffice to overcome them. Here one crucial ingredient 
is a new connection between the H-hee process and the Erdos-Renyi random graph, which might 
be of independent interest. More precisely, we develop a 'transfer theorem', which enables us to 
prove certain results for the H-bee process using the much simpler binomial random graph model. 
This is a key tool for establishing properties of the C^-free process which otherwise seem difficult 
to derive. We believe that it will also aid in proving new upper bounds for the H-iiee process. 
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1.3 Organization of the paper 



We start by collecting the relevant properties of the C^-free process in Section [2j In Section [3] we 
then introduce several probabilistic tools and the differential equation method. Section [4] is devoted 
to the proof of Theorem ll.il Our argument relies on two key statements, whose proofs are deferred 
to Sections [5] and [8l We apply the differential equation method in Section [5l and introduce the 
'transfer theorem' in Section [6l Next, in Section [7] we collect properties of the binomial random 
graph, which are then used to complete the proof in Section [8l 

2 The C^-free process: preliminaries and notation 

In this section we introduce some notation and briefly review properties of the C^-free process 
needed in our argument. We closely follow [3] and the reader familiar with the results of Bohman 
and Keevash may wish to skip this section. 

2.1 Terminology and notation 

Let G{i) denote the graph with vertex set [n] = {1, . . . ,n} after i steps of the C^-free process. Its 
edge set E{i) contains i edges; we partition the remaining non-edges (^g') ™to two sets, 0{i) 

and C{i), which we call open and closed pairs, respectively. We say that a pair uv of vertices is 
open in G{i) if G{i) U {uv} contains no copy of C^. So, the C^-free process always chooses the next 
edge ej+i uniformly at random from 0{i). In addition, for uv G 0{i) U C{i) we write Cuv{i) for the 
set of pairs xy £ 0{i) such that adding uv and xy to G{i) creates a copy of Gi containing both uv 
and xy. Note that uv £ 0{i) would become closed, i.e., belong to C{i + 1), if ej+i G Cuv{i)- 

With a given graph in mind, we denote the neighbourhood of a vertex v by r(t>), where, as usual, 
T{v) does not include v. For S C [n] we define T{S) = Ut,eS -'^('^)- Furthermore, for A, i? C [n], let 
e(j4, B) denote the number of edges that have one endpoint in A and the other in B, where an edge 
with both ends in A n i? is counted once. If the graph under consideration is G{i) we simply write 
rj(-), but usually we omit the subscript if the corresponding i is clear from the context. Given a 
set S and an integer A; > 0, we write (^) for the set of all A;-element subsets of S. 

We use the symbol it in two different ways, following [21 13]. First, we denote by a it 6 the interval 
{a + xh : —1 < X < 1}. Multiple occurrences are treated independently; for example, Yli&if^i^i ^ ^i) 
and niG[j]("i =t ^i) mean {I]jg[j](ai + Xik) : -1 < xi, . . . , xj < 1} and {HieblK + ^i^i) ■ ~^ ^ 
xi, . . . ,Xj < 1}, respectively. For brevity we also use the convention that x = a±b means x £ a±b. 
Second, when considering pairs of random variables and functions, e.g. , Y~ and y^, y~ , we 
use the superscript it to denote two different statements: one with it replaced by +, and the other 
with lb replaced by — . For example, Y^{i) = y^{t) means l'^(i) = y^{t) and Y~{i) = y~{t). 
Finally, combinations of both ways are treated independently; for example, Y^{i) = y^{t) zt b 
means Y^{i) = y^{t) it b and Y~{i) = y~^{t) it b. 

2.2 Parameters, functions and constants 

In the remainder of this paper we fix £ > 4. Following [3j, we introduce constants e, ^ and W. 
We choose W sufficiently large and afterwards e and /i small enough such that, in addition to the 
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constraints implicit in [3] for H = Ci, we have 

W>f2^+^>50, e<l/{2^H^) and 21^/-^ < e. (1) 

Since the additional constraints in [3] only depend on H = Ci, we deduce that /U is an absolute 
constant (depending only on i). Next, similar as in [3] we set 

p = n-'+yii-i)^ W = /u(logn)i/(^-i) and m = n'pt^.. = f,n'/('~'Hlogn)'/^'-'\ (2) 

Formally, m (a number of steps) should be defined as [n^ptmaxj , say, but, as usual, we will henceforth 
ignore the irrelevant rounding to integers. For every step i we define t = t{i) = i/{'n?p), where, 
for the sake of brevity, we simply write t if the corresponding i is clear from the context. Next we 
introduce the functions 

q{t) = e-(2*)'" and f{t) = e(*'-^+*)^. (3) 
Now, using ([1]), for every < t < tmax, for n large enough we readily obtain 

1 > q{t) > n-^l^ and 1 < /(t)g(t)^ < /(t) < . (4) 

2.3 Previous results for the C^-free process 

The results of Bohman and Keevash [3] imply that a wide range of random variables are dynam- 
ically concentrated throughout the first m steps of the C^-free process. For our argument the key 
properties are estimates on the number of open pairs as well as bounds for the degree and certain 
closed pairs. So, for the reader's convenience we state their results here in a simplified form. 

Theorem 2.1. ^ Set Se = n^^^'^^^~^ . Let Tj denote the event that for every < i < j, we have 
\0{i)\ > as well as 

\0{i)\ = (1 ± 3f{t)/se) q{t)n'/2 and (5) 

|rj(^')| < 3nptmax for all vertices v € [n]. (6) 

Let Jj denote the event that for every < i < j we have 

\Cuv{i)\ = ((^ - l){2tY"'^q{t) ± 7£f{t)/se^ p-^ for all uv € 0{i) U C{i) and (7) 

\Cu'v'{'i') n Cu"^"(i)| < n^'^^^p^'^ for all distinct u'v' ju"v" G 0{i). (8) 

Then Jm n Tm holds whp in the C^-free process. □ 

After some simple estimates, both ([5]) and ([6]) follow directly from Theorem 1.4 in [3j. Now, using 
aMt{Ci) = 21 and (2tY~'^q{t) < 1, which follow from elementary considerations. Corollary 6.2 and 
Lemma 8.4 in [3] imply ([7]) and ([8]). (Because the 'high probability events' of [3] in fact hold with 
probability at least 1 — n~'^^^\ we may take the union bound over all steps and pairs.) We remark 
that there is a factor of 2 difference in ([7| since we use unordered instead of ordered pairs. 

In our argument we use two additional properties of the C^-free process. The next lemma follows 
from Lemmas 4.2 and 4.3 in [23], which in turn are based on Lemmas 4.1-4.3 in [3]. 

Lemma 2.2. [24] Let fCi denote the event that for all a, 6 > 1 and every A,B CI [n] with \ A\ = a and 
\B\ = b, in G{i) we have e{A,B) < max{4e^^(a + b),pabn'^^}. Let Ci denote the event that for all 
a > 1 and d > max{16e~"^, 2apTi^^}, for every A C [n] with |^| = a we have \Dj\^^(i(i)\ < 16e^^d~^a, 
where DA^ii) C [n] contains all vertices v G [n] with |r(t;) r\ A\ > d in G(i). Then the probability 
that %n holds and K,rn H Cm fails is o(l). □ 
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3 Probabilistic tools 



In this section we introduce several probabilistic tools that we will use in our argument. 

3.1 Concentration inequalities 

The following Chernoff bounds, see e.g. Section 2.1 of |12j . provide estimates for the probability 
that a sum of independent indicator variables deviates substantially from its expected value. 

Lemma 3.1 ('Chernoff bounds'). Let X = -^i, where the Xi's are independent Bernoulli- 

distributed random variables. Set n = E.[X]. Then for all t > we have 

Furthermore, for all t >7fi we have 

¥[X >t]< e'K (10) 

In our argument we need to estimate the probability that in Gn^-p some subset contains 'too many' 
copies of a certain graph. Rodl and Rucihski [20] showed that exponential upper-tail bounds can be 
obtained if we allow for deleting a few edges; this is usually referred to as the Deletion Lemma |13j . 

Lemma 3.2 ('Deletion Lemma'). Suppose < p < 1 and that S is a family of subsets from {^^) ■ 
We say that a graph G contains a G S if all the edges of a are present in G. Let fi denote the 
expected number of elements in S that are contained in Gn,p- Let T)C{h, k,S) denote the event that 
there exists Xq C 5 with \Zq\ < b such that, setting Eq = UaeXo ^{''^iP) \ contains at most 
^ + k elements from S. Then for every b,k > the probability that T>C{b, k, S) fails is at most 

In [24J a slightly weaker variant of the above lemma was proven for the H-iiee process, where H is 
strictly 2-balanced. The results of Section [6] will shed some light on this intriguing phenomenon. 

3.2 Differential equation method 

A crucial ingredient of our analysis is the differential equation method, which was developed by 
Wormald [281 |29] to show that in certain discrete stochastic processes a collection V of random 
variables is whp approximated by the solution of a suitably defined system of differential equations. 
Developing ideas of Bohman and Keevash [3], the following variant was introduced in [23]. It 
will be an important tool for showing that certain random variables are dynamically concentrated 
throughout the evolution of the C^-free process. 

Lemma 3.3 ('Differential Equation Method' [241 Lemma 5.3]). Suppose that m = m{n) and s = 
s(n) are positive parameters. Let C = C{n) and V = V(n) be sets. For every < i < m set 
t = t{i) = i/s. Suppose we have a filtration J-q ^ J-i ^ ■ ■ ■ and random variables Xfj{i) and Y^{i) 
which satisfy the following conditions. Assume that for all a € C x V the random variables Xf^{i) 
are non-negative and J^i- measurable for all < i < m, and that for all < i < m the random 
variables Y^{i) are non-negative, Ti+i -measurable and satisfy 

X^{t + 1) - X^it) = Y+ii) - Y-{i). (11) 
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Furthermore, suppose that for all < i < m and T, £ C we have an event G J^j. Then, for all 

< i < m we define B<i{T,) = (Jq^^-^^ In addition, suppose that for each a € C x V we have 

positive parameters = Ua{n), Ao- = \a{n), = l^ain), r^, = Ta{n), s^ = Su{n) and = Sa{n), 
as well as functions Xa{t) and fa{t) that are smooth and non-negative for t > 0. For allO < i* < m 
and Ti £ C, let Qi*{T,) denote the event that for every < i < i* and a = with j £ V we have 

X„{i) = (x,{t) ± S,. (12) 



Next, for all < i* < m let £i* denote the event that for every < i < i* and S € C the event 
i3<j_i(S) U QiiTj) holds. Moreover, assume that we have an event %i £ for all < i < m with 
T~ii+i ^ T~Li for all < i < m. Finally, suppose that the following conditions hold: 

1. (Trend hypothesis) For all < i < m and a = (S, j) £ C x V, whenever 8i n -^B<_i{Ti) n "Hj 
holds we have 

^Y„H^\F,] = (yt{t)±^]^, (13) 
where y^{t) and h(^{t) are smooth non-negative functions such that 

<it) = yt{t)-y-it) and fa{t)>2fh„{T)dT + P„. (14) 

Jo 



2. (Boundedness hypothesis) For allO < i < m anda = (S, j) £ CxV, whenever £ir\^B<i{'E)r\7ii 
holds we have 



Y^^ii)<-^-—. (15) 



3. (Initial conditions) For all a £ C x V we have 



X^{0) = ( x.(0) ±i^]S^. (16) 



3Sa 

4- (Bounded number of configurations and variables) We have 

max{|C|,|V|} < min e""^. (17) 

(tGCxV 

5. (Additional technical assumptions) For all a £ C x V we have 

s > max{15uaTaisaXa//3af, 9saXa/f3a}, s/(18s^ A^//3^) <m<s- Ta/19U, (18) 



pm/s 

sup ytit) < A,, / \x'^{t)\ dt < A,, (19) 

0<t<m/s Jo 

(■ml s 

ha(fS) < s„\„ and / \h'^{t)\ dt < s^Xa- (20) 

Jo 



Then we have 



\^£rn n Um] < 4 max e""" . 

o-ecxv 



An important feature of Lemma 13.31 is that the variables in V are tracked for every configuration 
S G C. However, it only gives approximation guarantees for the variables that 'belong' to S as long 
as the 'local' bad event B<i(S) fails. For more details we refer to Section 5.3 and Appendix A.l 
in [23]. Here we just remark that if the above conditions 1-5 are satisfied for n large enough, Tim 
holds whp and = w(l) for all cr € C x V, then Lemma 13.31 implies that Sm holds whp. 
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4 Bounding the maximum degree 



In this section we prove our main result, namely that whp the maximum degree in the final graph 
of the C^-free process is 0((n log In Sections 14.11 and 14.21 we first discuss the main proof 
ideas and introduce the formal setup used. Section [4.31 is then devoted to the proof of Theorem II. H 
which in turn relies on two involved statements that are proved in subsequent sections. 



4.1 Sketch of the proof 

The following definition plays a crucial role in our proof. Given (v,U), where v G [n] and U C 
[n] \ {v}, a C^-extension for {v, U) is a path on ^ — 1 vertices whose end vertices are in U and whose 
remaining vertices are disjoint from U U {v}. Clearly, for every vertex v € [n], in the final graph of 
the C^-free process {v,T{v)) must not have a C^-extension. Set 

7 = max <^ 180 > and = 7nptmax = T/"!"- log ^)^^^^~^^ (21) 



again ignoring the irrelevant rounding to integers in the definition of u. In order to bound the 
maximum degree hy u = D{n\ogny^^^~^\ where D = 'jfi, it is enough to prove that whp every 
{v, U) G [n] X with v ^ U has at least one C^-extension after the first m steps. The same 
basic idea was used in [16], but our proof takes a different route, inspired by [21]. After i steps, we 
denote by Oi,^i/{i) the set of open pairs which would complete a C^-extension for {v, U) if chosen as 
the next edge. It seems plausible that it in order prove Theorem 11.11 it suffices to show that, after 
some initial number of steps, \Ov^u{'i)\ is always not too small. Indeed, this implies a reasonable 
probability of completing such an extension in each step, which in turn suggests that the probability 
of avoiding a C^-extension in all of the first m steps is very small. 

We now illustrate our approach for establishing a good lower bound on \Oy^i}{i)\ for the case when 
^ = 5. For ease of exposition, we ignore factors whenever these are not crucial and also assume 
that the number of steps i is large. So, in our rough calculations we will e.g. ignore whether an edge 
is open or not, since |0(i)| = a;(n^~'^) by (|4]) and ([5]). Note that in this case we have p = n~^^^, 
m ^ \C:,y{i)\ w and \U\ ^ np = nV^ by ([2]), (UD and 1^. 



4.1.1 The random variables used 



We define as the set of pairs xy £ Ov^i/{i) with x £ U and y ^ U U {v}. Observe that for 

every xy £ 0'-jj{i) there exists a path voViV2 = y with vq £ U \ {x} and vi ^ U U {v,x,y}, cf. 
Figure [H The 'last' edge completing a Cs-extension for [v, U) could be any one of the edges of the 
path, so we expect that 0'~ contains constant proportion of Oy^u{i). 

Vi 



y = V2 

Figure 1: A pair xy £ O'- Solid lines represent edges and dotted lines open pairs. 

Let Zij^u{i) contain all quadruples {vo,vi,V2,V3) £ Ux [n]^ x U with {fo^i, t^it^2} ^ V2V3 £ 0{i) 
and {^1,^2} n ([/ U {v}) = 0. Using random graphs as a guide, we expect that G{i) shares many 
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properties with the binomial random graph G„_p, since its edge density is roughly 2tp ~ = p. 

So, given y, the expected number oi vq £ U for which there exists a path voViV2 = y should be 
roughly n\U\p'^ = o(l). Hence on average xy G is contained in only one such path ending in 

U, which suggests that up to constants \Zv^u{i)\ « \0'- To sum up, our discussion indicates 

that a reasonable lower bound for \Zi,^if{i)\ suffices to prove that |0{i^[/(i)| is large. For this we intend 
to use the differential equation method and so we introduce additional variables in order to control 
the one-step changes of \Zij^u{i)\. To this end let Yy^u{i) be the set of all {vq, ^1,^2,^3) G U x [n]^ x U 
with {vi,V2}ri{U{J{v}) = that satisfy vqVi G E[i), 1^2, ^^21^3} ^ and, similarly, let X,i,^u{i) 
contain all such quadruples with {vqVi,viV2-,V2V^} C 0(i). 

4.1.2 Technical difficulties 

One of the main problems with the approach described above is the bound on the one-step changes. 
It can happen that in one step up to quadruples are removed from Z^^u{i), which turns out 
to be too large for applying the differential equation method directly. Indeed, pick u, U such that 
{vq} U Ti{w) C [/, \Ti{w)\ ^ \U\ and v ^ {w} UU U Ti{U); taking the random graph G ji p as a 
guide, for Cj+i = wvq it is easy to see that about {np)'^\U\ ^ p~^ quadruples (t^O) ^^i) ^^2, ^^3) with 
^^3 £ ^iiw) are removed from Zfj^u^i). For the C4-free process this can be resolved using ad-hoc 
arguments (e.g. exploiting that every v ^ v satisfies |rj(w) n J7| < 1 if no C4-extension for {v,U) 
exists), but for larger cycles the situation is more delicate. To overcome this issue, we consider a 
different random variable Ty^jj{i), which is an approximation of Z^^i}{i) and is defined in such a way 
that the one-step changes are automatically not too large. Roughly speaking, this can be achieved 
by 'ignoring' the steps where the one-step changes would be too large; similar ideas have been used 
e.g. in [21 [3l UU [2l]. Clearly, this introduces a new difficulty: we need to ensure that we do not 
ignore 'too much', so that on the one hand the expected one-step changes are still 'correct', and on 
the other hand \Zi,^u{i)\ \Ty^u{i)\ holds. Consequently, we refine the tracked variables and use 
more sophisticated rules for ignoring tuples. 

There is another significant obstacle when applying the differential equation method: adding Cj+i = 
V1V2 to {vq,vi,V2,v^) G Yy^uii^) does not always result in an element of + since ej+i = V1V2 

closes f2f3 whenever ^2^3 G Cy-^v^{i) holds. This is an important difference to the Q-free process 
with ^ < 4, where this does not cause any problems when bounding the maximum degree. For 
example, whenever this happens for £ = 4, it is not difficult to deduce that at least one C4- 
extension for {v, U) already exists. Returning to the case i = 5, using our random graph intuition 
we expect that |yu^(7(i)| |?7pn^p ^ rJl'^. Similar calculations suggest that the expected number 
of quadruples in with ^2^3 G C.u^.u^{i) should be negligible compared to |l{;^;7(i)|. However, if 

we pick V such that Vi{w) C \J and |rj(ty)| w for v ^ {w\ U [/urj([7), it certainly can happen 
that there are ■ np ■ n ^ \Y{,^ij{i)\ quadruples in 15,(7(2) with V2V3 G C^^t,2(i). In other words, it 
is simply not true that for all {v, U) the effect of these 'bad' quadruples is negligible. This is a new 
difficulty in comparison to the variables tracked in the analysis of the H-fiee process [2>\. To deal 
with this issue, we substantially refine the tracked random variables, developing ideas used in [24) . 
Intuitively, we show that for every (v, U) there exists a slightly altered set of random variables where 
the above extreme example (and other difficulties) can be avoided. Here the new 'transfer theorem' 
(Theorem 16. 2p is an important ingredient, which allows us to use the much more tractable binomial 
random graph model for certain calculations (see Section [7]) . 
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Figure 2: The neig hbourhoods N^^\S) = N^j\S,X) for j £ [3], where S may also intersect with 
X and the vertex classes, i.e., with X U Vi U V2 U V3. Furthermore, S D N^^\S) 7^ is also possible. 

4.2 Formal setup 

We now introduce the formal setup used in our argument. In the following it is useful to keep in 
mind that we intend to apply the differential equation method (Lemma 13. 3p . 

4.2.1 Preliminaries: neighbourhoods and partitions 

Recall that by dHJ = 7^(nlogn)-'^/'^^ ^\ We set 

A; = n/60 = 7/60-nptmax = 7/u/60-(nlogn)^/(^"^) and r=[n/{£-3)\. (22) 

Given X CI [n], we partition {1, . . . ,{i — 3)r} \ X as follows: for every 1 < j < ^ — 3 we set 

Vj = Vj{X) = {ve[n]\X : {j - l)r < v < jr}. (23) 

With a given graph in mind, which will later be G{i) or the binomial random graph, for every 
S C [n] we define its neighbourhoods wrt. X as 

N'-^\S,X) = S and M^+'^\S, X) = T{M^\S, X)) nVj+i{X), 

see also Figure [2l Observe that all N^^\S, X) are disjoint if C X. Furthermore, X QY implies 

Vj{Y)<ZVj{X) and N^^\S,Y) <^ N^^'^ {S, X). (24) 

Finally, for the sake of brevity we define N^^^\S,X) = \Jo<j'<j N^^'\S,X). 

4.2.2 Configurations 

We define the set C of configurations to be the set of all S = {v, U,A,B, R) with v £ [n],U £ ('"'')/''^) , 
disjoint A,B G (^), and R C [n] with {v} U U R and \R\ < kn^'^^". Given S G C, we then set 
Ts = A X Vi X • • • X X B, where each Vj = Vj{R) is given by (|23]) . 

Given S G C, distinct x,y G [n] and j £ [^—1], let Cx,y,T.{'i-, j) contain all pairs bw € BxN^^-^\A,R) 
for which there exist disjoint paths b = wi ■ ■ ■ Wj = x and y = wj^i ■ ■ ■ wi = w in G{i). Note that 
adding xy and bw completes a copy of containing both xy and bw. Furthermore, observe that 
Cx,y,T,{hj) and Cy^x,T.{hj) may differ. So, for all xy € 0{i) U C{i) we see that the intersection of 
Cxyii) with B X N^^~^\A, R) is contained in |Jjg[^_;^] [Cx,y,j]{i, j) U Cy^x,T,ih j)] ■ Finally, note that 
by monotonicity we have Cx,y,T,{i,j) ^ Cx,y,T,{i' + l^i)- 
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Figure 3: Tuples {vo,vi,V2,V3) in T^fl{i), Ts,i(i) and Ts,2(«) for £ = 5, where S = {v,U, A, B, R). 
Solid lines represent edges, dotted lines open pairs and dashed lines pairs that are open or closed. 
For the other pairs there is no restriction, i.e., they may be open, closed or an edge. 

4.2.3 Random variables 

For every S G C we track the sizes of several sets throughout the evolution of the C^-free process. For 
brevity, given {vq, . . . , V£^2) G Ts, we set fj = vj-iVj for all 1 < j < £ — 2. For every < j < ^ — 3 
we introduce sets T^jii), which for < j < £ — 3 will satisfy 

Ts,,« C {(t;o,...,t>,_2) GTs : {fi,...,fj}CE{i) A {/j+i, . . . , /,_2} C 0(i)}, (25) 

and for the special case j = ^ — 3 we will have 

T^,e-3{i)Q{{vo,...,ve-2)^T^ ■ {fi, . . . , fes} Q E{i) A A_2 e 0(i) U C«}, (26) 

see also Figure [3l Note that /^_2 can be in 0{i) or C{i) for Ts/_3(i), but we will see later 
that the number of tuples with pairs in C{i) is negligible. In the following we define the Tj]j{i) 
inductively, starting with Ty;j{0) = for j > and rs,o(0) = Te. Now suppose the process chooses 
ej+i = xy G 0{i) as the next edge in step i + For j > a tuple (uq, . . . , ^£-2) S T^j-iii) is added 
to Tsjii + 1), i.e., is in T^j{i + 1), if fj = ej+i, {fj+i, . . . , fe-2} n Cf^{i) = 0, and in G{i) there is 
no path wq - ■ ■ Wj = Vj with wq G A. Furthermore, for j < £ — 3 a tuple {vq, . . . , ve-2) £ T^jii) is 
removed, i.e., not in T^j^i + 1), if ej+i G {/j+i, . . . , fe-.2} or a+i G C/^+i(i) U • • • U Cf^_^{i). For 
the special case j = £ — 3, a tuple (t>o, . . . , vi-2) G T^/sii) is removed, i.e., not in T^/sii + 1), or 
ignored, i.e., remains in Tj^/sii + 1), according to the following rules: 

Case 1. If /^_2 = ej+i, then the tuple {vq, . . . , V£_2) is removed. 

Case 2. If e^+i G Cf^_^{i), then the tuple {vq,. . . ,^^-2) is 

(R2) removed if there exists j G — 1] and x,y £ [n] such that e^+i = xy, /£_2 € 

Cx,y,T,{i,3) and |C^,j/,s(«,i)| < p"^n~^°^^, and 
(12) ignored otherwise. 

The above definition clearly satisfies ()25p and (j26p . Intuitively, the rules for removing tuples from 
Ts/-3(«) ensure that the one-step changes are 'by definition' not too large. Furthermore, the way 
in which the tuples are added yields the following extension property IAt- 

Lemma 4.1. Given i > 0, let UT{i) denote the property that for all Ti £ C and I < j < £ — 3, for 
every {vj, . . . , ^^-2) £ Vj x • • • V^^^ x B there exists at most one {vq, . . . , Vj-i) £AxViX---x Vj-i 
such that {vq, . . . ,V£^2) G Ui'<i Then Ut = l^rii) holds for every i>0. □ 
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The proof proceeds by induction on i and j; we leave the straightforward details to the reader (it 
is helpful to observe that after (vq, . . . ,^£-2) S Tsj_i(i) is added to ?Sj(i + 1), no further tuples 
containing Vj can be added due to the vo---Vj path). Note that by Ut every {vj , . . . ,ve-2) G 
Vj X ■ ■ ■ V^_3 X i? is contained in at most one tuple in |Ji'<i ^Ej(i')- This is an important ingredient 
of our argument, and we remark that a simpler variant of this property has previously been used 
in [21]. 

Recall that our goal is to show that there are many open pairs whose addition would complete a 
C^-extension for [v, U). Given E = (i), [/, A, B, R), note that for every (vq, . . . , £ TY:/-s{i), if 
fi-2 G 0{i), then adding fe^2 to G{i) would complete such a C^-extension. Now, since Ut implies 
that every pair fi_2 = with x G Vt-z and y € i? is contained in at most one such tuple in 
Ts/-3(«), our aim is to obtain a lower bound on the size of 

^s/-3W = {(^^o,...,t^£-2) G7e,£-3« : ^-2G0(i)}. (27) 

4.2.4 Bad events 

The following bad event i3j(S) is crucial for our argument: it addresses the two main technical 
difficulties outlined in Section r4.1.2[ For all < i < m and S G C we define BiiTj) = Si^j(S)Ui32,i(S), 
where 

Bi^iijl) = in G{i) there are more than k'^{npY~^n~^'^ pairs (6, w) £ B x N^^^^\A, R) for which 
there exists a path b = wq - ■ ■ w^_2 = w, and 

i32,i(S) = in G{i) we have |Ls(«)| > p^^n^^^'''^^\ where L-s{i) contains all xy G ('2^) with 
maXjg[£_i]{|Cx.,y,s(^, j)|, \Cy,x,i:{i,j)\} > p-^n'^^^^. 

Clearly, Bi{T,) depends only on the first i steps and is increasing, i.e., i3j(S) C i3j-|-i(S) holds. 

We now briefly give some intuition for 0i^j(S) and i32,i(S), which are important ingredients for 
estimating the number of tuples added to T^/-3{i + 1) and removed from Ts/_3(i). First, recall 
that {vq, . . . ,Vi-2) S TY:/-4{i) can not be added to Tj^/sii + 1) if fe-2 S Cfg_.^{i). For such 
'useless' tuples there exists a path ve-2 = wq - ■ ■ we-2 = ve-4 with {ve-2, ve^4,) G -B x N^^^^\A, R) 
in G{i), and whenever ^Bi^i^S) holds there can not be 'too many' such pairs. As we shall see, from 
this we can deduce (using the extension property Ut) that the number of 'useless' tuples is small 
compared to \T^/-Aii)\- Second, recall that not all tuples (vq, . . . , ^^-2) £ Te,£-3(*) are removed 
if ej+i G Cf^_^{i): some are are ignored. Here the key point is that Cj+i G Cf^_^{i) \ L^ii) is a 
sufficient condition for being removed, and, with ([7]) in mind, that -i;B2,j(S) essentially implies that 
|Ls(«)| is small compared to \Cf^_^{i)\. Intuitively, this will allow us to show that the ignored tuples 
have negligible impact, i.e., that |Zs,£-3(i)| ~ |7e,£-3(0I- 

4.3 Proof of Theorem [TT] 

In this section we prove Theorem 11.11 assuming the following two statements. Intuitively, the 
first lemma ensures that for 'good' configurations S the variables |Ti;j(i)| are dynamically con- 
centrated, and the second lemma essentially guarantees that for every (v, U) there exists a good 
S* = {v,U,A,B,R) for which \TY;*/-3{i)\ ~ |^E*/-3(*)|- Now we give some intuition for the tra- 
jectories our variables follow. Using ([5]), we see that the proportion of pairs which are open or an 
edge in G{i) roughly equals q{t) or 2tp, respectively, where t = i/{n?p). So, using random graphs as 
a guide, it seems plausible to expect |Tsj(i)| ~ Cj{2tpy q{tY~'^~-^ k?r^~^ ^ where the factor Cj = 1/j! 
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takes into account that we only count tuples created in a certain order. In the following results the 
functions q{t), fit) and parameters k, m, p, r, u are defined by ([2]), ([3]), ([2T|) and ([22]) . 

Lemma 4.2. For all < i* < m and T, £ C, let Qi* (S) denote the event that for every < i < i* 
and all < j < £ — 3 we have 

|Tsj(i)| = {i2t)^qit)'-^-^/j\ ± fit)qitf-^-^/n'') e/-^ , (28) 

and let £j denote the event that for all < i < j and T, £ C the event Bi-iiT,) U ^i(S) holds. Then 
£m holds whp in the C^-free process. 

Lemma 4.3. Let TZj denote the event that for all < i < j, for every ({;, U) G [n] x (f^') with 

V there exists S* = iv,U,A,B,R) G C such that -i;Bj_i(S*) holds and 

\T^*/-3ii) \ ^E'/-3(i)| < k\rp)'-\-''. (29) 
Then TZm holds whp in the Cg-free process. 

The proofs of these lemmas are rather involved and therefore deferred to Sections [5] and |8l With 
these results in hand, we are now ready to establish our main result. 

Proof of Theorem \l.l\ For the sake of concreteness, we prove the theorem with D = jfi. Given 

V G [n], U C [n]\{i;} and i < m, let Xfj^fj^i denote the event that up to step i, there is no C^-extension 
for iv, U) in the C^-free process. By we denote the event that there exists iv,U) G [n] x (f"') 
with V ^ U for which X^^u^^ holds. Furthermore, for every i < m we set Ai = R T^j R 7i, where 71 
is defined as in Theorem 12.11 and £i, IZi as in Lemmas 14.21 and 14.31 If Xm fails, then, as discussed in 
Section [4.1^ the C^-free process has maximum degree at most u = Din\ogn)^/^^~^\ So, since Am 
holds whp by Theorem 12.11 and Lemmas 14.21 and 14. 3| to complete the proof it suffices to show 

nXmr\Ara]=oil). (30) 

Suppose that for m/2 < i < m the event Ai = £i CiTZi CiTi holds. Observe that £{ R -iSi_i(Il*) 
implies ^i(S), which is defined as in Lemma 14.21 Using ([2]) we see that m/2 < i < m implies 
t = i/in^p) = a;(l), so for j = £ — 3 the main term in the brackets of (j28|) is i2tY~^ qit) / it — 3)! 
since /(t)/[n2^g(t)] = o(l) by g]). Thus, whenever R7^j holds, using ([28]), ([29]) and g(t) > n"*^/^, 
it follows that for every (i), U) with U G ('"^^^^"'^) there exists S* = iv,U,A,B,R) G C satisfying 

\Ts*/-3ii)\ > k\2tprY-\it)/ii - 1)! and \T^*,,-^ii) \ Zs*,^-3(i)| < k\2tprY-^qit)n-'' . 

Note that % gives g(t) > |0(i)|/n^ by dH and (l5|). So, combining our findings with .^e*,^-3(^) ^ 
2\;*/-3(«), using k = ti/60, r > n/i, (i2T]l and t = i/in^p) we see that for such S* we crudely have 

I^E*/-3(0I = \Ti:',i-3ii)\ - |Ts.,^-3(i) \^s*/-3(i)l > k\2tprY~\it)/l\ 

> 5u\tpn)'-'Mt) = 5^^qit) > 6-^\0ii)\. 

Recall that Oy^uii) ^ 0(z) denotes the set of open pairs which would complete a C^-extension for 
iv, U) if chosen as the next edge e^+i. Let Oe*(z) be the set of all xy G 0(i) for which there exists 
ivQ, . . . , ve-2) S ^E*,£-3(0 with /£_2 = xy. As already discussed in Section [4.2.3l by construction we 
have OE*(i) C Oy^uii), and Ut implies |Os*(i)| = |-^s*,£-3(0l- Together with ([3T]) this establishes 

»,2„-£— 3 

|Os,f7(i)| >5^^|0(z)|. (32) 
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Using this estimate, we now prove (j30p . To this end fix {v, U) G [n] x with v ^ U. We see that 

,U,m n Am/2] n 

,u,i+i n A+i I Xv,u,i n A] 

m/2<i<m— 1 

^ (33) 

< 11 p[ei+i ^ Oi),t/(i) I ;f5,c/,i n Ai]. 

m/2<i<m—l 

Note that CiAi depends only on the first i steps of the process, so given this, the process fails 

to choose ej+i from Oi^u{i) with probability 1 — \Oi^i[{i)\/\0{i)\. Now from ([32]) and (f33]) as well 
as the inequality 1 — x < we deduce, with room to spare, 

m/2<i<m-l J ^ 

Substituting the definitions of m, u, p and tmax into ([Mj) we obtain 

where the last inequality follows from (|2ip . i.e., the definition of 7. Finally, taking the union bound 
over all choices of {v, U) implies (j30p . which, as explained, completes the proof. □ 



^Xi,u,m n Am] < exp <j -^n^ V ^4ax^ = ^ "T^^^T^^^^ log n < n 



5 Trajectory verification 

This section is devoted to the proof of Lemma [4.21 Henceforth we work with the 'natural' filtration 
given by the C^-free process, where Fi corresponds to the first i steps, and tacitly assume that n is 
sufficiently large whenever necessary. For every < i < m we set Tii = %, where Ji, % are 
defined as in Theorem 12.11 Clearly, T-im holds whp. Furthermore Hi+i C and Hi G J-i, since 
T-Li is monotone decreasing and depends only on the first i steps. We set s = n?p and apply the 
differential equation method (Lemma l3.3p with V = {0, . . . ,i — 3}. Recalling that BiiY^) is monotone 
increasing, we see that i3j(E) = B<i{T,). For all cj € C x V we define 

Ua = kn^^^^ = a;(l), \^ = = rf ^ = 1, and = So = n^^ ■ (35) 

Formally, for all a = {T.,j) G C x V we set X^{i) = |Tsj(i)| and Yj'ii) = \T^j{i)\, where T^j{i) = 
+ 1) \ T-£.j{i) and - {i) = Tj]j{i) \ Tj]j{i + 1). But, for the sake of clarity, we will henceforth 
just use |TEj(i)| and |r^^(i)|. Now, for every cr = (S,j) G C x V we set ^^(t) = Xj{t), y^{t) = x^{t), 
Sa = Sj, fait) = fj{t) and Kit) = hj{t), where 

xj{t) = ■ {2tyq{tY-^~^, S, = k\'-^p^, (36) 

x+(t) = 2j/j\ . {2ty~'q{tY-'-^, f,{t) = f{t)q{tY~'-\ (37) 

xjit) = 2(1-2- - l)i2tY-^x,{t), hj{t) = /j(t)/2. (38) 

The definition of x^{t) might seem overly complicated, but it conveniently ensures x[J'(t) = and 
x~j'{t) = 2xj-i{t) / q{t) for j > 0. With the above parametrization we can restate ()28p as 

|Ts,,(i)| = {x,{t) ± f,{t)/so) A;2/"V. (39) 

The remainder of this section is organized as follows. First, in Section [5.11 we verify the trend hy- 
pothesis of Lemma 13. 31 and, next, the boundedness hypothesis in Section r5.2i Finally, in Section [531 
we check the remaining conditions of the differential equation method. 
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5.1 Trend hypothesis 

In order to establish p^ . whenever £i n -^BiiTj) n Hi holds, for every j G V we have to prove 

E[|T|^«ll^d=(-?W±^)^^$^. (40) 

5.1.1 Basic estimates 

The following inequalities were given in [23], and can easily be verified using elementary calculus. 
Recall that a ± 6 denotes the interval {a + x5:— 1<3;<1}, see Section [2?T] 

Lemma 5.1. |24^ Lemma 7.1] Suppose < x < 1/2. Then 

(l±x)"^ C l±2x. (41) 
Lemma 5.2. [211 Lemma 7.2] Suppose x,y, fx, fy, g, h > and g <1. Then fx + xg < h/2 implies 

{l±g)ix±fx)^x±h. (42) 
Furthermore, xfy + yfx + fxfy + xyg < h/2 implies 

il± g)ix±fx)iy±fy)^xy±h. (43) 

5.1.2 Triples added in one step. 

In this section we verify ()40p for T^j{i). 

The case j = 0. Clearly, adding an edge to G{i) can not create new open tuples in T^fi{i). Thus 
we always have |rjtQ(i)| = = XQ{t), which settles this case. 

The case j > 0. Recall that ej+i E 0{i) is added to G{i). Let P^j-iii) contain all {vq, . . . , vi-2) G 
TT.,j-iii) for which there exists a path wq . . .Wj = Vj with t^o G ^ in G{i). Similarly, D^j^i{i) C 
contains all tuples with {fj+i, ■ ■ ■ , fe-2} H Cf .{i) 7^ 0, where /,/ = Vj'-iVj/. With these 
definitions in hand, note that {vq, . . . ,^^-2) G 7i;j-i(^) is added to Tsj(i + 1), i.e., is in Tsj(i + 1), if 
and only if fj = e^+i and (wq, . . . ,^^-2) ^ -Ps,j~i(^) U -Dsj-i(^)) see Section |4. 2. 3[ Since the Q-free 
process chooses Cj+i uniformly at random from 0{i), whenever £i ^Bi{T,) OT-Li holds we have 

nn,ii)\\T.]= T^r (44) 

(t>0,...,l'f-2)eTs,,_i(i)\[PEa-l(i)UDEj-l(i)] ' ^ ^' 

We now bound the size of Psj_i(i). Since Tii implies Q, the degree of every vertex is bounded 
by, say, npn^. So, using |^| = A; < npn^ , j < i — 3, {npY~'^ = n^^^/^^^^^ and r > n/i, in G{i) the 
number of wj for which there exists a path wq . . . wj with wq £ A is at most 

\A\ • {npny < {npn^y-^ < ^1+^^-1/(^-1) < rn'^/m. (45) 

Given Wj, we now bound the number oi (vq, . . . , V£^2) G ^E,j-i(i) with = Vj. Observe that there 
are at most k{nprfy~^ choices for such vi, . . . , Vj-i, and at most r^~^~^k choices for Uj+i, . . . , f£-2- 
Putting things together, we deduce that 

|Ps,i-i(i)| < rn-i/(2^) . k{npny~^ • r^-^~^k < k^/-^p^-^n-^/'-^^\ (46) 
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Vj-i Vh ^/i.^— 





Vh Vj=Vh-l Vj Vh-l 



Figure 4: The sohd hues represent paths such that adding both fj = vj-ivj and fh = v^^iVh 
completes a copy of Q consisting of those paths. In other words, adding fj closes fh, i.e., fh G Cf. (i). 

Turning to D^j^i{i), we first consider the case where < j < i — 3. Suppose that fh G Cf.{i). 
Depending on whether h = j + 1 or h > j + there exists either a path Vj-i = wi - ■ ■ wi^i = Vh with 
j < /i < £ — 2, or a path wi ■ ■ ■ Wk = Vh-i with wi G {vj,Vj-i}, 1 < k < £ — 2 and j<h— 1<£ — 2, 
cf. FigureHl So, in both cases, there exists a path wi ■ ■ ■ Wk = Vx with wi S {vj,Vj-i}, 1 < k < £—1 
and j < X < i — 2. With this observations in hand, we are now ready to estimate the number of 
tuples {vq, . . . ,vi-2) G D-sj-i{i). Recall that by T-li the degree of every vertex is at most npn^. It 
follows that there are at most k{nprf y~'^r choices for vq,. . . ,Vj, and at most £'^ choices for h and 
X. Given vq,. . . ,Vj as well as h and x, there are at most 2l{nprfY~'^ < rn~^^^^^^ choices for Vx by 
(I45p . Since we already picked Vx with j < x < i — 2, for the remaining vertices among t'j+i, . . . , V£_2 
we have at most r^~^~^k choices. Putting things together, we see that for < j < ^ — 3 we have 

|^sj-i(i)l < kinpny-^ -r-^ ■ rn'^l^^^^ ■ r^-^-^k < k^^^'^p^-^n'^/'^^^l (47) 

Now we bound for the remaining case j = £ — 3. Recall that /^_3 = V£_4Vi^3. If 

fi^2 = ve-sVi-2 S Cf^_,^{i), then, with a similar reasoning as in the previous case, there exists a 
path V£-4 = Wo - ■ ■ W£-2 = V£-2, where V£-4 E N^^~'^\A, R) and U£_2 G B. Since ^Bi{Y,) holds, by 
there are at most fc^(np)^~^n~^^ such pairs (v£_2,W£-4) € B x N^^~'^\A,R) in G{i). Recall 
that by the extension property Ut (cf. Lemma l4.ip every triple (?;^_4, u^_3, U£_2) is contained in at 
most one tuple in Tj^/-4{i). So, since there are at most k"^ {npY~'^n~^'^ choices for V£_4,w^-2) ^-nd 
at most r choices for ves £ Ves, using Ut we deduce that for j = ^ — 3 we have 

|^Ej-i(i)| < k\npY-\~''' ■ r < k\'-^/-\-^' = k\'-'pi~'n~^'. (48) 

After these preparations, we now estimate (I44|) whenever £i n -i;Bj(S) n Hi holds. Observe that 
£iri^Bi{T,) implies ^i(S), and so |Tsj_i(i)| satisfies ([39]) . Furthermore, since Tii holds, this implies 
that |0(i)| satisfies ([5]). In addition, note that Se = n^/^"^^^'^ and ([4]) imply f{t)/se = o(l) and 
fj-i{t) > 1. Substituting the former estimates and (j46p - (j48p into (j44p . using n^/^^^'^ > n^^ = oj^Sq), 
(liTjl . xj{t) = 2xj^i{t)/q{t) and fj{t) = fj^i{t)/q{t), we deduce that 



mT+Ai)\ I -F,; 



(xj_i(t) ± fj.i{t)/so)k'^r^"^p^'^ ± 2A;2/-V 



c 



(l±3/(t)/se)9(t)nV2 
c,_i(t)±2/,_i(t)/so)A:2r^-V-' 



(l±3/(t)/se)^7(i)n72 
C (1 ± 6/(t)/se) • (xfit) ± Afj{t)/so) ■ k\'-^pi/{r?p). 

Therefore the desired bound, i.e., ()40p for T^-{i), follows if 

(1 ± Qf{t)/Se) ■ {x+{t) ± 4fj{t)/So) C X+{t) ± hj{t)/So. (49) 

Now, using f{t) = o{se) and Lemma |5.2|, by writing down the assumptions of (j42p and multiplying 
both sides with 2so, observe that ()49p follows from 

8fj{t) + 12x+ {t)fit)So/Se < hjit). 
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Using dH) and (f37|) we see that the second term on the left hand side is o(l). So, it suffices if 

8f,{t) + l<hj{t), 

which is easily seen to be true, since hj{t) > W/A ■ {fj{t) + 1) and > 50 by (P), (HD and (f88|) . 
5.1.3 Triples removed in one step 

Next, we prove ()40p for T^-{i). Since the rules for removing tuples from TY,j{i) are different for 
j < £ — 3 and j = i — 3, we use a case distinction. 

The case j < i — 3. Recall that a tuple {vq, . . . , i'^_2) £ Tj]j{i) is removed, i.e., not in TY,^j{i + 1), 
if ej_|_i G {fj+ii • • • ) /^-2} or Cj+i G Cj.^^(i) U • • • U Cf^_^{i). Since the edge Cj+i is chosen uniformly 
at random from 0(i), whenever £i n n Hi holds, using . . . , fi-2}\ < ^ we have 

\Cf.^Ai)\J---UCf, Ai)\ ±i 
n\T^M\^^^= E ^^'^ • (50) 

K,-,f«-2)eTs,,-(i) ' ^ 

Note that Hi implies that the inequalities ([7|) and ([8|) hold. In particular, using n^/^ = u}{se), 
n~^Ap~^ = 6t;(l) and f{t) > 1, this yields 

\Cj^^,{i) U • • • U ±^ C - j - 2)[{l - l){2tY-\{t) ± 7^/(t)/seb-i ±^2^-1 V ±^ 

C - i - 2)[(^ - l)(2t)^-2g(t) ± Uf{t)/Se\p-\ 

(51) 

Since <f^jn-ii3j(S) implies ^i(S), it follows that |Tsj(i)| satisfies ([39l) . In addition, as in Section [5.1.2l 
f{t)/se = o(l) holds and |0(i)| satisfies ([5]) by Hi. Substituting the former estimates into ([50|) . and 
using (HH) as well as x7(t)/j;j(t) = 2{t - j - 2){£ - l)(2t)^-^ we obtain 



n\TAAi)\ I 



{x,{t) ± f,{t)/so)kV'-^pi -{e-j- 2)[{i - l){2tYS{t) ± Uf{t)/se]p-^ 



' {l±3f{t)/se)q{t)n^/2 
C (1 ± 6/(t)/5e) • (x,(t) ± /,(t)/5o) • [x]{t)/x,{t) ± 2Qef{t)/{q{t)se)\ ■ kV-^pi/in'p). 

Therefore the desired bound, i.e., ()iO]) for T^-{i), follows if 

(1 ± 6/(t)/se) • (xj(t) ± /i(i)/so) • [xT(t)/x,-(t) ± 2Qef{t)/{q{t)se)] C xT(t) ± /i,(t)/so. (52) 

We now show (|52p using Lemma [5.21 Similar as for the added tuples, by writing down the assump- 
tions of ()43p . multiplying with 2so and then noticing that all terms containing contribute o(l), 
we see that it suffices if 

{1-2- - 1)2^-' f,{t) + 1 < h,{t), 
which is easily seen to be true, since hj{t) > W/2 ■ {t^^^fj{t) + 1) and W/2 > 1^2^- by ([I]) and ([38]) • 

The case j = £—3. Recall that a tuple (fg, • • • , ^^-2) ^ ^s,^-3(0 is removed, i.e., not in TE^£_3(i+l), 
if Cj+i = ft-2-, or in addition to e^+i E Cf^_^{i) it is not ignored. A moment's thought reveals that 
for every {vq, . . . ,v^_2) S rs/_3(i) with e^+i G Cf^_^{i), if ei+i ^ LE(i) then (R2) holds, where 
Lx;(i) is as in the definition of B2,iiJ^)- In other words, for every (wq, . . . ,^^^-2) G ^s/~3(^) we see 
that ej+i G C f^_^{i)\LY,{i) is a sufficient condition for being removed. Clearly, a necessary condition 
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for being removed is ej+i G {fe-2} U Cf^_^{i). Combining our previous findings and using that ej+i 
is chosen uniformly at random from 0{i), whenever D ^Bi{T,) H Tii holds we deduce that 

Recall that Hi implies the inequalities ([7|) and ([8|). Furthermore, since -i;B2,«(S) holds, we have 
|-^^s(OI < p~^n~^/^'^^\ So, similar as in the previous case, using n^/^'^^^ = uj(se), n~^^^'^^^p~^ = oj{l) 
and f{t) > 1, we obtain 

\Cf^_M ± \Lj,{i)\ ± 1 C [(^ - im'-^qit) ± 7£fit)/se]p-'±p-'n-'/^^'^ ± 1 

<Z[{£-l){2tY'^q{t)±9if{t)/s,]p-\ 



where the final estimate equals that of (jSip for j = £ — 3. It is not difficult to see that the remaining 
calculations of the case j < £ — 3 carry over word by word, which yields ()40p for To 
summarize, we have verified the trend hypothesis (j40p . 



5.2 Boundedness hypothesis 

Observe that in order to verify the boundedness hypothesis (fT5|) . using (j35|) it suffices to show that 
whenever £i n n Tii holds, for every j e V we have 

\Tij{i)\ < k/-^p>n-^^^^'. (53) 



5.2.1 Triples added in one step. 

In this section we verify (I53p for Recall that e^+i G 0{i) is added to G{i). By construction 

we always have [T^ o(^)| = 0, and thus we henceforth consider the case j > 0. Note that a necessary 
condition for {vq, . . . ,^£-2) G TEj-i(i) being added to TY;j{i + 1) is fj = Cj+i. Observe that there 
are at most kr^~^~^ choices for (vj+i, . . . ,^£-2) £ ^j+i x • • • x V^-s x B. So, using the extension 
property Ut (cf. Lemma [4.ip . we deduce that for each ej+i there are at most kr^~^~^ tuples in 
TY.^j-i{i) with fj = Cj+i. Together with ([2]), (|4]), (122]) and j >l this implies 

|r+^.(z)| < k/-^-^ = k/'^p) • {rp)~^ = o{k/-^p>n-^°^'), (54) 

as desired. 



5.2.2 Triples removed in one step 

Next we use case distinction to establish (f53l) for ■{%). 

The case j < £ — 3. We claim that whenever £i n -ii3i(S) n Tii holds, for all {vq, . . . , S 7sj(z) 
and every xy G {/j+i, • • • , the number of tuples in Tj]j{i) containing xy is bounded by 

yfcr^-Vn^^ (55) 

First suppose that xy = fj+i- For ■ ■ ■ ,ve-2) £ ^+2 x ••• x V£_3 x B there are at most 

j^j.i-'i-j < kr^-'^p) choices, and so (l55]l follows using the extension property Ut (cf. Lemma l4?T]) . 
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Next we consider the case xy = fi-2- As usual, whenever T-Li holds, the degree of every vertex is 
bounded by, say, npn^. Since for every [vq, . . . ,^£-2) S T^,j{i) the vertices fo, • • • ,Vj form a path 
starting in A, we deduce that there are at most k{npn'^Y choices for such uq, • • • ^Vj. Furthermore, 
there are most r^~^~j choices for (fj+i, . . . ,^£-4) € ^j+i x • • • x Therefore the number of 

tuples in Tj]j{i) with xy = fi-2 is bounded by kinprfy ■ r^^^^-' < kr^~'^p^ n^'' , as claimed by (155p . 

Finally we consider the case where xy = fh with j + l<h<l — 2. With a similar reasoning as 
in the previous case, there are at most k(nprfy choices for fo, • • • jfj, at most r^~^~'^ choices for 
f j+i, . . . , Vh-2 and at most kr^-^-^ choices for v^+i, • • • , ^£-2- To summarize, there are at most 

k{npn'y ■ r^-^-'^ ■ k/^^-^ < k^r^-^p'n^^ < kr^-^p> 

tuples in Tx;.j(i) with xy = fh, which establishes (ISSh . with room to spare. 

With the above estimate in hand, we are now ready to bound \Tj^ Recall that {vq, . . . , ve-2) G 
rsj(i) is removed, i.e., not in rsj(i + l), if e^+i G {fj+i, ■ ■ ■ , fe-2} or Cj+i G C/^.^^(i)U - • •UC/^_2(i), 
which is equivalent to {fj+i, ■ ■ ■ , fe-2} H Ce^^i (i) 7^ 0. In other words, such a tuple is removed if for 
some j + l<h<£ — 2we have fh = e-j+i or fh G Cej^i(i). Recall that whenever Tit holds, by ([7]) 
we have, say, |Cej_|^i(i)| < p^^n^. So, using that ([55ll gives an upper bound for the number of tuples 
in T^j{i) which contain fh, we deduce that 

\T^j{i)\ < (£+ |Ce,+i(i)|) • A;/"Vn^^ < k/-'^p>~^n^^' < kr^-^p> ■ n^^y{rp), 

which, with a similar reasoning as in (1540 . establishes (I53p for - {i) with j < i — 3. 

The case j = £—3. Recall that a tuple {vq, . . . , V£-2) G T^/sii) is removed, i.e., not in Ts,£_3(i+1), 
according to different rules. In the following we bound the total number of tuples removed in one 
step by each rule, which were called cases 1 and 2 in Section 14.2.31 In case 1 we have fe-2 = ei+i 
and so, given ej+i, using Ut we deduce that at most one tuple is removed under case 1. 

Turning to case 2, given ej+i = xy, note that a necessary condition for being removed by (R2) is 
that for some j G — 1] we have ft-2 £ Cx,y,T,{i-, j) or fn-2 £ Cy^x,T.{hj)- Recall that by Ut every 
such pair fi_2 is contained in at most one tuple in Ts/_3(i). So, since a tuple is only removed 
if the corresponding Cx,y,'T,{h3) or Cy^x,s{i,j) has size at most p^^n~^^^^, we deduce that at most 
2£ • p-'^fi-^oie f^ijpigg are removed in one step by (R2). 

Putting it all together, using p^^ = {npY~'^ and np < k, for j = ^ — 3 we obtain 

|T-^_3(i)| < 1 + 2^p-in-30fe < {npY-^n~^''' < k{npf-^n-^'''', 
which readily establishes the boundedness hypothesis (f53|) . 

5.3 Finishing the trajectory verification 

In this section we verify the remaining conditions of the differential equation method (Lemma 13. 3p . 

Initial conditions. Using (j36p . for j > we clearly have |rx;j(0)| = = Xj(0), which settles these 
cases. For the remaining case j = we crudely have 

|rs,o(0)| = iTsI = k^{r ± kn^^^^f-^ = (1 ± kn^^^' /rf-^k^/-^ C (1 ± o{l) / So)k'^r^-^ , 

which together with 2;o(0) = 1, 5*0 = k'^r^~^ and 13^ = 1 establishes (jl6p . 
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Bounded number of configurations and variables. Using k = u/60 and ()35p we obtain 
|C| < n • C^] • 3" • Yl Cl)^ n""^^^^"''' < e'^"'"' = 

which together with |V| < £ clearly establishes pTj) . 

Additional technical assumptions and the function fa{t). Using s = n?p as well as ([2]), 
(I2ip and ()35p . straightforward calculations show that (llSp holds, with room to spare; we leave the 
details to the reader. Recall that by ([2|) we have tmax = 'm/s = 0((log n)^/(^~^)). Furthermore, 



using (i36]) - (f38|l . elementary calculus yields x'f{t) = 0(t^i ^) and = 0{t^tl ^) for t < t 



Thus, since for all a = G C x V we have x„{t) = Xj{t) and y^{t) = x- {t), it follows that 

y^{t) = 0(log' n)<n' = \, and / 

0<i<m/s 



sup y^{t) = 0(log^ n) < = and / = 0(logn • log^ n) < A^. 

<t<m/s Jo 



Recall that for ah a G C x V we have K{t) = f'^{t)/2 and /^(t) = f{t)q{t)\ where /, G {0, . . . , £ - 3}. 
Hence, using /o-(O) = 1 = /3o-, we see that 

f,{t) = 2 /* K{t) dr + /,(0) = 2 r /i,(r) dr + 
JO Jo 

Note that h„{0) = 0(1) < n^^ = s^Xa and /i'^(t) > 0. Pick t* = t*{i) > 1 large enough such that 
for all t > t* we have t'^^ < f{t). Observe that h'^{t) is bounded by some constant for t < t* , and 
note that for larger t we have, say, h'^{t) < W^f{t) . Putting things together, using ([2]) and ([U, 
i.e., m/s = O(logn) and f{t) < n^, we readily obtain 

pm/s j-t* /•m/s 

/ \K{t)\dt< K{t)dt+ W'^fit)'^ dt<0{l) + 0{logn-n^^) <n^^ = sAa- 

Jo Jo Jt* 

To summarize, we showed that (I14p as well as the additional technical assumptions (I18p - (l20p hold, 
and this completes the proof of Lemma 14.21 □ 



6 A 'transfer theorem' for the H-free process 

In the H-iiee process there is a complicated dependency among the edges, and thus standard 
concentration inequalities are not directly applicable. In this section we show how to overcome 
this problem for decreasing properties by establishing a 'transfer theorem'. Roughly speaking, this 
allows us to 'transfer' results for decreasing properties from the binomial random graph model to the 
H-free process, at the cost of only slightly increasing the 'expected' edge density. In our argument 
this will be a crucial tool for establishing Lemma l4.3i 



6.1 Relating the H-iree process with the uniform random graph 

We start by relating the H-hee process with the more familiar uniform random graph. In the //-free 
process the set of open pairs 0{i) is defined in the obvious way: it contains all pairs xy G (^g') \ E{i) 
for which G{i) U {xy} remains H-fiee. The following estimate is not best possible, but it suffices 
for our purposes and keeps the formulas simple. 
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Lemma 6.1. Suppose Q is a decreasing graph property and that A = A(n) > 2 is a parameter. 
Then for every 1 < ^ < (2) /A, setting M = i\, we have 

P[G(i) ^ Q and \0{i)\ > n^X] < P[G„,m ^ Q] + e-'/\ (56) 
where G{i) denotes the graph produced by the H-free process after the first i steps. 

Proof. We sequentially generate the edges 61,62, .. ., where each edge 6j+i is chosen uniformly at 
random from E{Kn) \ {ei, 62, ... , ej}. On the one hand, the edge-set {61, 62, ... , 6m} clearly gives 
Gn,M- On the other hand, we obtain the graph produced by the H-free process by sequentially 
traversing the ej and only adding those edges which do not complete a copy of H. First, for every 
1 < j < M we define the indicator variable Xj for the event that ej is added to the graph of the 
H-free process, and, furthermore, define the random variable 

i<i'<i 

which counts the number of edges in the graph produced by the H-free process after traversing 
61, . . . , 6j. Next, for every 1 < j < M we define 

^_h. ,f|0(X.-.)|<„VA. „ 
[Xj, otherwise, ^p^^. 

If [©(X-'"^)! > n^/A holds, we have Yj = Xj by construction. In this case the next edge is added to 
the graph of the H-free process with probability at least \0{X^-^)\/ {'^) > 2/A. Otherwise Yj = 1 
holds, and so we conclude that ^[Yj = 1 | Yi, . . . , l^-i] > 2/A, which implies that Y^^ stochastically 
dominates a binomial random variable with M trials and success probability 2/A. With this in 
mind, standard Chernoff bounds, see e.g. ([9]) of Lemma 13. H give 

P[yA/ < 2i-t] < e-*'/(^*). (57) 

In the remainder we prove (|56p . To this end first observe that 

F[G{i) i Q and \0{i)\ > n'^/X] < ¥[G{i) i Q and X^'^ > i] + ¥[\0{i)\ > n^/X and X^^ < i]. (58) 

Note that by construction > i implies G{i) C Gn,M, and, since Q is a decreasing graph property, 
in this case G{i) ^ Q implies Gn,M ^ Q- It follows that 

P[G(i) ^ Q and X^' >i]< F[Gn,M ^ Q]- 

Furthermore, since 0{i) is decreasing, if both |0(i)| > n^/A and X'^'^ < i hold, then this implies 

P[|0(i)| > nVA and X^^ < i] < F[Y^^ < i] < 6"^/^ 
Substituting these bounds into ([58]) gives ([56]) . completing the proof. □ 

If we relax the additive error in Lemma l6.1l to o(l), then for |0(i)| > (2) /A a slight modification of 
the above proof works with M = iX + u!{l)X^/i; we leave these details to the interested reader. 
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6.2 A 'transfer theorem' for decreasing properties 

Using Theorem 12.11 and we see that |0(m)| > holds whp in the C^-free process. So, 

setting A = A(n) = n^/^ and using the 'asymptotic equivalence' of the uniform and the binomial 
random graph for monotone graph properties (see e.g. Section 1.4 of |12]). Lemma l6. II readily gives 
the next theorem. A similar idea is used in j26j for H = K4. Observe that the edge-density of G{m) 
is roughly 2ptjnax = 

G(p(logn)^/(^-i)) in the C^-free process. Intuitively, the following theorem thus 
states that for decreasing properties, G{m) is 'comparable' with the binomial random graph with 
only slightly larger edge density prf . 

Theorem 6.2 ('Transfer Theorem'). Define m = m{n) and p = p{n) as in ([2]). Suppose that e is 
chosen as in ([1]) and that Q is a decreasing graph property. Then for the Ci-free process we have 

P[G(m) ^ Q] < F[Gn,pn^ ^ Q] + 0(1). □ 

In fact, this result also holds for the H-fiee process, where H is strictly 2-balanced, if m, p and e are 
chosen as in Sections 1.2 and 1.3 of [3|, since then |0(m)| > n^~^/^, with room to spare. We believe 
that the above 'transfer theorem' will significantly aid in the future analysis of the //-free process, 
since for decreasing properties it often allows us to work with the much easier binomial random 
graph model, which has been extensively studied and for which e.g. sophisticated concentration 
inequalities are available. 



7 Properties of random graphs 

In this section we introduce several decreasing graph properties, which are key ingredients in our 
proof of Lemma 14.31 Using the 'transfer theorem' of Section [6l it suffices to prove that they hold 
whp for the binomial random graph Gny with p' = pn^, where p is defined as in ([2]) and e is chosen 
as in ([T|). We remark that essentially all results in this section are not best possible, but suffice for 
our purposes. For example, in an attempt to keep the formulas simple, we have not optimized the 
multiplicative factors involved (their contribution in our later arguments will be negligible). 

7.1 Basic properties 

Lemma 7.1. Let J\f denote the event that for all pairs of distinct vertices x,y € [n] we have 
\T{x) n r(y)| < 9. Then N holds whp in Gn^. 

Proof Using ^ > 4, (P) and ([2]), i.e., p = n'^+^/^^^i) < n^^/s and e < 1/20, we deduce that 

PhAA] < Q ("-^~ ^) ipn'f" < n\npWf^ < n2(n-i/3+2e^io ^ ^^^^^ 
as claimed. □ 

The following result states that every set of size at most u contains a large independent subset. A 
similar argument was used by Bollobas and Riordan in [B|. 

Lemma 7.2. Let I denote the event that for every U C [n] with \ U\ < u there exists an independent 
set S C U with \S\ > \U\/6. Then I holds whp in Gn,p'- 
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Proof. Let £ denote the event that every U C [n] with < n spans less than 3\U\ edges. We have 

E (:) (2) s E (f)^(f)"(p"=)^'£E(™•V»-)^ 

1<X<U ^ ^ ^ ^ 1<X<M X>1 

Using ^ > 4, ([I]), ([2]) and dHJ, i.e., u < npn", p = n"i+V(^-i) < ^"2/3 g^^^^j ^ < j^/gQ^ 

which implies P[-ii?] = o(l). Suppose that f holds. Then every set of at most u vertices induces a 
graph with minimum degree less than six. Given U C [n] with \U\ < u, we set W = U. Now, by 
iteratively selecting a vertex v £ W with at most five neighbours in and removing {v} U T{v) 

from W, we obtain an independent set with at least \U\/6 vertices, and the proof is complete. □ 

7.2 Bounding the numbers of certain paths 

The results in this section give estimates for the numbers of certain paths. Their statements will 
contain certain exceptions, and, as we shall see, many of these complications are in fact necessary. 

7.2.1 Preliminaries: the size of certain neighbourhoods 

The following crude upper bound on the degree of every vertex readily follows from standard 
Chernoff bounds (Lemma 13. ip - we omit the straightforward details. 

Lemma 7.3. Let D denote the event that for every v € [n] we have |r(t')| < npn'^^ . Then D holds 
whp in Gn^pi ■ □ 

With similar reasoning it is also not difficult to see that whp for all large sets 5, in Gn^p' we have, 
say, |r(5')| > |5'|np, which is much larger than 15"!. Intuitively, the next lemma thus implies that for 
most reasonable sized A [n], only a small proportion of T{S) is contained in N^-^~'^\A., S U A). 

Lemma 7.4. Let Ai denote the event that for all disjoint yl, C [n] with \ A\,\S\ < kn^^ we have 

e{S, N'^^^~^\A,SuA)) < kn^^'. (59) 

Then Ai holds whp in Gn^p' ■ 

Proof. Let ^' contain all pairs {A,S) with disjoint A, S* C [n] satisfying l^l,!^! < kn^^ . Given 
%l) = {A,S) e ^, let M.^ denote the event that ([Ml) holds, and let y^p contain all y C ^ u 
Ui<d<e-3 VdiSuA) with \Y\ < (npn^^y-^n^^ . Given tp = {A, S) £ and Y G y^, let denote 
the event that iV(^^-3) ^-^^ SUA) = Y. Using k < npn^ , it is not difficult to see that whenever D 
holds, then for every ip £ some with Y G y^ holds. Furthermore, clearly implies that 

some with ■0 G ^' fails. So, we obtain 

¥[-.M\<¥[-.V]+ ^ PhA^^nAA^,y]. 

Note that for every -0 = (A, 5) G ^' the events are mutually exclusive. So, using l^*] < tl^^^^" 

and that T> holds whp by Lemma 17.31 to finish the proof it is enough to show that for every 
■0 = (^, 5) G ^' and 1" G 3^^ we have 

PhTW^ I M^y] < n-^('="''). (60) 
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Observe that we can find Y = N^-^^'^^A, 5 U A) by starting with N^°\A, S U A) = A, and then 
iteratively testing vertices in Va{S U A) to see whether they are adjacent to N^'^~^\A, S U A), up 
to d = £ — 3. Since S is disjoint from A and all Vd{S U A) with 1 < d < i — 3, this exploration has 
not revealed any pairs between S and Y. We deduce that, conditioned on M^y, all edges between 
S and Y = N^-^'^\A, S U A) are included independently with probability p' = p-nf . Now, using 
{npY~^ = p~^ and £ > 4, the expected number of these edges is bounded by 

\S\ ■ \Y\ -p' < kn^'- ■ {npn^'Y-^n^' ■ pn' = kn^^^+''^' < kn^^^~^^' . 

Thus standard Chernoff bounds, see e.g. (fTO]l of Lemma [3?T| imply (f60]l . completing the proof. □ 

7.2.2 Paths ending in the neighbourhood of another set 

We start with a technical lemma, which will be used in the subsequent proofs of Lemmas 17. 61 and 17. 81 

Lemma 7.5. Let Qi denote the event that for all v G [n] and A,X CI [n] with A <^ X and 
\A\, \X\ < kn^^^ , for every 2 < j < £ — 1 and < d < £ — 3 there are at most at most {npy~^n^^^ 
vertices w £ N^-'^^ {A, X) for which there exists a path 

v = wo---Wj = w with {wo,...,Wj-i}nN^-'^'>{A,X) = (/}. (61) 

Then Qi holds whp in Gn.p' ■ 

Proof. Let ^ contain all tuples (u, A, X,j, d) with v e [n], A, X Q [n], 2 < j < £-1 and <d<£-3 
satisfying A C X and |^|, \X\ < kn^^^ . Given il) = {v,A,X,j,d) G ^, by we denote the event 
that there are at most (npy~^n^^^ vertices w E N^-^\A, X) for which there exists a path satisfying 
(j6ip . Clearly, -iQi implies that some with ^ G ^ fails. 

Next, given ijj = {v,A,X,j,d) £ we denote by the set of pairs {Y, Z) with Y O AU 
Ui<d'<d^'(^) ^ — ['^IV^ satisfying \Y\ < {npn'^^Y'^^n^^^ and \Z\ < {npn?^y~^ . Furthermore, 
for every Y C [n] and v G [n] we inductively define 

r(o)(^,y) = {x;}\y and T^'+^\v,Y) = T{T^^ {v,Y))\Y. (62) 

Given ijj = {v,A,X,j,d) G ^' and </> = {Y, Z) £ y^, let Af^^^ be the event that N'^^'^\A,X) = Y 
and r(j-i)(t;,y) = Z. Whenever V holds, using k < npn^ it is easy to see that for every ^/^ G \I' 
some M^^tf) with (p G y^ holds. Putting things together, we obtain 

i^={v,A,x,j,d)€<f <p=(y,z)€y^ 

Since T? holds whp by Lemma [7.3( using |^| < ji^kn''^^ that for every ^ G ^' the events A/'^,^ are 
mutually exclusive, to complete the proof it suffices to show that for every ijj = {v, A, X, j,d) G ^' 
and (p = {Y, Z) G y^ we have 

PhQ^|AA^,0]<n-"^'"'''). (63) 

Recall that on A/"^.,/, we have Y = N^^'^\A, X) and Z = T^^-^\v, Y). Every li; G F for which there 
exists a path satisfying ([6T]) is contained in T{Z), and so whenever fails we deduce \T{Z) ny| > 
(npy~^n^^'^ , which in turn implies 

e{Y,Z)>{npy~^n^^'. (64) 
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Next we analyse the distribution of the edges between Y and Z conditional on M^^^. We can 
iteratively determine Y = N^-'^\A^X) as in the proof of Lemma 17.41 Then, given Y, we can 
similarly find Z = T'^^-^\v,Y); by ^ this can clearly be done without testing any pairs between 

Y and Z. It certainly can happen that during the first exploration, i.e., when determining y, we 
have already revealed some pairs between Y and Z, consider e.g. the case where Z r\ Vi{X) ^ 0. 
However, by construction all such pairs are non-edges. Therefore the number of edges between 

Y and Z is stochastically dominated by a binomial distribution with \Y\ ■ \Z\ trials and success 
probability p' = prf . Using d < £ — 3 and j < £ — 1 as well as [np)'^^^ < {npY^"^ = p^^, the 
expected value of the corresponding binomial random variable is at most 

\Y\ ■ \Z\ -p' < {npn^'f+^n^^' • {npn^y-^ ■ pn' < ^npy-^n^5i+2d+2j+i)e < ^^^^j_i^{9^-i)£_ 

So, since j > 2 and k < npn'^, standard Chernoff bounds show that (I64p holds with probability at 
most e"'^"'^*^, see e.g. (fTO]l of Lemma [HJI This establishes (i63]l and thus completes the proof. □ 

Given a vertex v £ [n], we expect that roughly {np'Y~'^ vertices w G [n] are endpoints of a path 

Y = wq - ■ ■ W£-2 = w. Loosely speaking, the next lemma states that there are significantly fewer 
such vertices w if we only count endpoints in a certain restricted set and forbid some exceptional 
paths. For the argument of Section [8] it is important to observe that Vi is monotone decreasing. 

Lemma 7.6. Let Vi denote the event that for all disjoint A,S [n] with \A\, \S\ < k there exists 
X C [n] with \X\ < kn^^^ , such that for every v £ S there are at most {npY~^n^^^^ vertices 
w G A^(^^3) (A, X) for which there exists a path 

V = wq - ■ ■ W£_2 = w with wi ^ A. (65) 

Then Vi holds whp in Gn,p' ■ 

Proof. By Lemmas 17.31 17.4] and 17.51 the event DOAinQi holds whp. In the following we are going 
to argue that for every graph G satisfying those properties, Vi holds as well. As this claim is purely 
deterministic, it suffices to prove it for fixed disjoint A,SC [n] with |^|, |5| < k. By Ai there are at 
most kn'^^'^ edges between S and N^-^~^\A, S U A). Let Vs^a contain the endpoints of those edges 
and define 

X = AUSU Vs,A- (66) 

Note that |X| < kn^^^ . Given v £ [n], by we denote the set of w G N^^ ^\A, X) for which there 
exists a path satisfying ([65]). To finish the proof, it suffices to show that for every v £ S we have 

\Wy\ < (np)^-3^iofe_ (g7) 

Fix V £ S X . Since S D A = for every path v = wo - ■ ■ W£-2 = w with w £ N^^~^\A, X) there 
exists 1 < j < ^ — 2 such that 

{wo,...,Wj^i}nN^^^~^\A,X) =$ and wj £ N^^^~^\A,X). (68) 

Recall that by assumption wi ^ A. So, by (I24p and (I66p we may restrict our attention to the case 
j > 2, since S has no neighbours in N^-^~^\A,X) \ A. Now, as Qi holds, considering d ■(^ £ — 3, 
for every 2 < j < £ — 2 we deduce that there are at most {npy~^n^^^ vertices wj £ N^-^~^\A, X) 
for which there exists a path v = wq ■ ■ ■ Wj satisfying (|68l) . Recall that the degree of every vertex 
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Figure 5: Examples of (2,2)-paths for i = 5. As usual, solid lines represent edges; for the other 
pairs there are no restrictions. Note that wi may be in j4 U 5 or the vertex classes Vi U V2. 



is at most npn'^^ by P. So, given wj, there are at most {npv?^Y ^ ^ vertices w G N^^ '^\A,X) for 
which there exists a path Wj • • • u)^_2 = w. Putting things together, we deduce that 

2<i<^-2 

As explained, this implies Vi, and the proof is complete. □ 

Note that in Lemma 17.61 a condition of the form wi ^ A is necessary. Indeed, standard Chernoff 
bounds imply that whp every vertex has degree Q{np'). Furthermore, e.g. with a similar argument 
as in the proof of Lemma 10.6 in [5], one can show that whp for all choices of A, S, X, for all Z C ^ 
with \Z\ > np we have, say, \N^^~^\Z, X)\ > \Z\{npY~^ > {npY~'^ ■ So, by picking A G (^^^) such 
that it contains at least np = o{k) neighbours of some vertex v* , we have at least {npY~'^ vertices 
w G N^^~^\A, X) which are endpoints of paths v* = wq ■ ■ ■wi-2 = w with wi G A, violating the 
claimed bound. 



7.2.3 Paths connecting two sets 

Given A,B,X C [n], for every j > 1 and < d < ^ — 3, we say that wq - ■ ■ Wj = f d • • • f is a 
{j,d)-path wrt. {A,B,X) if vq £ A, wq e B and Vd' G Vd'{X) for all I < d' < d, cf. Figured 
Intuitively, the next technical result states that the number of {j, (i)-paths is not 'too large' if we 
allow for deleting a few edges. 

Lemma 7.7. Let Q2 denote the event that for all A,B CI [n] with \A\, \B\ < k there exists F C ('^^^ 
with \F\ < kn^"^ , such that for every 1 < j < i — 1 and < d < £ — 4 the number of {j,d)-paths 
wrt. {A,B,AUB) that are edge disjoint from F is bounded by k'^{npy~^n^^'^ . Then Q2 holds whp 
in Gn,p' ■ 

Proof. Fix A,BC1 [n] with |^|, \B\ < k. Given j and d, we denote by Sj_d = Sj^di^, B) the family of 
edge-sets of ah possible (j, d)-paths wrt. {A,B,AVJB). Clearly, \Vd'{AVJB)\ < n for ah 1 < d' < d. 
So, using p = {np)~^^~'^\ j < ^ — 1 and d < i — 4, the expected number fij^d of such (j, (i)-paths 
satisfies 

t^j,d < k\^+''~\pn'Y+^ < k\npy+''~W'' = k\npy+''+^-'n^'' < k\npy-^n^'' . 

Set Kj = k'^{npy~^n^^^ and h = kn^ . Using the Deletion Lemma (cf. Lemma l3.2p the probability 
that VC{b, Kj,Sj^d) fails for some 1 < j < I — I and 0<d<£ — 4is bounded by 

i<i<^ o<d<e-4 
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with room to spare. Whenever 'DC{b, Hj^Sj^d) holds, we denote by Fj ii the corresponding 'ignored' 
edge set £"0 as in Lemma [3.2i If all 'DC{b, Kj,Sj^d) with I < j < i — 1 and < d < £ — 4 hold 
simultaneously, then defining F as the union of all edge sets Fj^^ has the required properties. Finally, 
taking the union bound over all choices of A and B completes the proof. □ 

For most large sets B and W, we expect that the number of {b,w) ^ B x W for which there exists 
a path b = wq - ■ ■ = w should be roughly |i?||Vl^|n^~^p'^~^ = \B\\W\n^^~'^^^ /{np). Loosely 
speaking, the next lemma suggests that for most reasonable sized A,B CI [n], this upper bound 
holds for W = N^^~^\A, X) if we forbid certain exceptional paths, as in this case \W\ ~ \A\(np'Y~^. 

Lemma 7.8. Let V2 denote the event that for all disjoint A,BC1 [n] with \A\,\B\ < k there 
exists X C [n] and F C (f^') with \X\ < kn^^^ and \F\ < kn^^ , such that the number of pairs 
(6, w) G B X A^(^^4) (A, X) for which there exists a path b = wq - ■ ■ Wi-2 = w with 

wi ^ A and {w2 ^ A or {wqWi,wiW2} C\ F = $^ (69) 
is at most k'^{npY~^n^^^^ . Then V2 holds whp in Gn,p'- 
Before turning to the proof, note that V2 is monotone decreasing. 

Proof of Lemma \7.8[ By Lemmas 17. 3| 17.41 17.51 and 17.71 it is enough to show that V2 holds for every 
graph G satisfying V Ci M. Ci Qi Ci Q2. As this claim is purely deterministic, it suffices to prove it for 
fixed disjoint A,B c: [n] with \A\, \B\ < k. Given X C [n] and F C (M), we denote by Pj^d{X,F) 
the set of (j, (i)-paths wrt. {A,B,X) that are edge disjoint from F. By Q2 there exists F C (t^^) 
with |F| < fcn^^ such that for all 1 < j < £ - 2 and < d < ^ - 4 we have 

\Pj,d{A U B,F)\ < k\npy-\'^^'. (70) 

Let Vp contain all vertices outside A that are endpoints of edges in F. Note that \Vp\ < 2kn'^^. 
Considering S ■'^ BUVpjhj M there are at most fcn^^^ edges between BUVp and N^-^~^\A, B U 
VpD A). Let Vb,f contain the endpoints of all those edges and set 

X = AuBUVpU Vb,f- (71) 

Observe that, say, |X| < kn^^'^ . Furthermore, using (j24p we see that 

VfH (J V;(X) = and T{Vf ^ B) r\ {N^^^'-^\A,X)\ A) = il). (72) 

l<ft<£-4 

For every 1 < j < ^ — 2 we define Wj as the set of all pairs {b,y) € B X A^(^^-4)(A,X) for which 
there exists a path b = wq ■ ■ ■ Wj = y satisfying ([69]) and 

{wo,...,Wj^i}nN'^^^~^\A,X) = ^ and wj e N^^^-^\A,X). (73) 

We claim that in order to complete the proof, it suffices to show that for all 1 < j < £ — 2 we have 

\Wj\< k'^{npy~\^^^'. (74) 

Indeed, let W contain all pairs (b, w) & B x A^(^^^) (A, X) for which there exists a path b = 
wq - ■ ■ Wi-2 = w satisfying ([69]) . Note that for every such b = wq - ■ ■ u)f_2 = w there exists 1 < j < 
1 — 2 such that b = wq ■ ■ ■ wj satisfies ([73]) . Recall that by V the degree is bounded by npn^^ . So, 
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given Wj, there are at most {npv?^Y ^ ^ vertices w £ A^^^ ^\A,X) for which there exists a path 
Wj ■ ■ ■ W£_2 = w. Putting things together, assuming ([7^ we obtain 

\W\ < l^il • {npn^'Y-^'^ < k\npY-\^^^', 

i<i<^-2 

and so 1^2 holds, as claimed. 

We shall now prove (j74p . Observe that for j = 1 we need to consider paths wqWi with wq & B 
and wi G N^^^-^\A,X) \ A. Now, using the second part of ((72]) we see that wi G r(^i;o) n 
{N^-^~^\A, X) \ A) is impossible. This implies \Wi\ = 0, which clearly establishes ()74p for j = 1- 

For j > 2 we first consider H^j,F ^ W^j , which contains all pairs (b, y) G Wj for which there exists a 
path b = Wq ■ ■ ■ Wj = y satisfying (f73]l and 

{wQtfi, • • • , tfj-iWj} n F = 0. (75) 

Clearly, for every (6, y) G there exists < d < £ — 4 such that at least one (j, (i)-path wrt. 

(A, with b = wq and ti;^ = y satisfies (|75]) . We claim that the corresponding (j, (i)-path 

WQ-'-Wj = V(i---vo is edge-disjoint from F, i.e., contained in Pj^rf(X, F). To see this, observe 
that every / G • • • ,viVo} n F has at least one vertex outside of A, say G Vk{X) with 

1 < K < (i, which contradicts ()72p . since by construction ij^ G Vp. In addition, by ()24p and (j7ip we 
see that Pj_rf(X, F) C Pj^diA U B,F). Putting things together, using (f70]l our discussion yields 

\Wj,F\< \PjAX,F)\< Y1 \PiAA^B,F)\<e{npy-\''''. (76) 

0<d<£-4 0<d<e-4: 

It remains to estimate the number of pairs in Wjp = Wj \ Wj^p, where the corresponding paths 
intersect with F. We start with the special case j = 2, i.e., paths b = woWiW2 = y with (6, y) G W2 p 
satisfying (f69l) . Observe that every / G {woWi,wiW2} n F contains G Vp, since ^ ^ by (f69]l . 
Note that W2 e A contradicts the second part of ([69]), and that W2 G T{wi) D {N'-^^-'^\A, X) \ A) 
is impossible by ([72]) . To sum up, |W2*j7'| = 0, which together with (f76]l implies (f74|l for j = 2. 

Turning to j > 3, for every 1 < ? < j we denote by W*p^ C W*p the set of pairs (6, y) G W'j*^ with 
y ^ A where the corresponding path b = wq ■ ■ ■ Wj = y satisfies Wc;-iWc; G F and ([73]) . We claim 
that it is enough to show that for every 1 < ? < j we have 

\W*pJ < e{npy-\^^'. (77) 

Indeed, since there are at most \B\ ■ \ A\ < k'^ < k^{npy~^ pairs (6, y) G W*p with y £ A, we obtain 

\WIp\ < k\npy-^+ I^IfJ < k\npy-^n^^^, 
i<?<i 

which together with (I76p establishes (I74p . as claimed. 

In the following we verify (fTTl) . First we show that iM/^j?^! = for <^ G {j — If Wj-iWj G F, 

then TUj ^ A implies G Vp, but the remaining possibility Wj G A^'^-^^^)(A, X) \ j4 contradicts 
(I72]l . If Wj_2Wj-i G F, then by ([73]) we have Wj-i ^ iV(-^-^)(yl, X) and so Wj^i G Vp. Since by 
assumption Wj ^ A we must have Wj G r(t(;j_i) n {N^-^~^\A, X)\A), which is impossible by (f72]l . 

Now, suppose that Wi;-iWc; G F with 1 < ? < j — 2. Considering v ^ w^ and d ^ — 4, by 
Qi there are at most (npy~'-~^n^^^ vertices Wj G N^-^~^\A, X) for which there exists a path 
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Wc; = w'q ■ ■ ■ w'j__^ = Wj with {w^, . . . , Wj^i} n N^-^ ^H^; ^) = 0- So, using |F| < A;n^^, since there 
are at most \B\ = k choices for b £ B, for ? > 2 we deduce that 

\W*pJ < \B\ • 2\F\ • [npy-'-^n^^' < k^{npy-'-\''^^+^'^' < k'^{npy-^n^^' , 

as claimed. Note that for the remaining case <; = 1 each (ordered) edge wqWi G F also determines 
the vertex b = wq £ B. So, compared to the estimate above we win a factor of \B\, and a virtually 
identical calculation yields that ()77p also holds in this case, which completes the proof. □ 

With very similar reasoning as for Lemma 17.61 one can argue that an extra condition for the case 
W2 G A is needed in Lemma 17.81 this time we can otherwise violate the claimed bound whp by 
fixing some vertex v* and then choosing disjoint A,B CI [n] such that each contains at least np 
vertices from r(t'*); we leave the details to the interested reader. 

8 Very good configurations exist 

In this section we prove Lemma Given a graph property y, let denote the event G{i) £ y, 
i.e., that G{i) satisfies y. Now, for every < -i < m we set 

^Vi = I^nlC^nCinM^n Vi., n V2,i n 

where /Cj, £«, % are defined as in Theorem 12.11 and Lemma [2^ and X, AA, Vi, Vi are defined as 
in Lemmas 17.11 17.21 17.61 and 17.81 It is not difficult to see that Wi is monotone decreasing and, 
using the 'transfer theorem' (Theorem 16. 2p . that Wm holds whp. Observe that by monotonicity 
Wm implies Wj for every i < m, and that ^Bi(Ti) implies -^Bi-i(Ti). So, to complete the proof it 
suffices to consider fixed G{i) satisfying Wi and show that for every {v,U) with U e ('"^i/^^) there 
exists S* = {v,U, A, B, R) G C satisfying -ii3j(E*) and (|29l) . In fact, since the above claim is purely 
deterministic, it is enough to also consider fixed {v,U). Our proof proceeds in several steps and 
we tacitly assume that n is sufficiently large whenever necessary. First, in Section [8.11 we choose a 
'special' configuration S* = (v, U, A, B, R) and collect some of its basic properties. In the remaining 
sections we verify that S* has the properties claimed by Lemma 14.31 More precisely, in Section [8.21 
we show that -ii3i(S*) holds, and in Section [8.31 we establish ()29p . 

8.1 Finding S* = {d, U, A, B, R) 

In the following we show how we pick S* = {v, U, A, B, R). Along the way, we furthermore collect 
some immediate properties of the resulting S*. We set 

r = m and 'd = 20^t = 800 (78) 

For the main steps of our argument it is useful to keep in mind that "& ^ t ^ £ and i?e <^ 
First, we choose S Q U such that 

S is an independent set and |5| > n/6, (79) 

which is possible since Xj holds. Henceforth we assume that vi, . . . ,Vn G [n] are ordered so that 

|r(t;i) n 5| > |r(?;2) n s| > • • • > \r{vj) n 5| > • • • > \r{vn) n s\. (80) 
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We greedily choose first Ia, and afterwards Ib, such that they are the smahest indices for which 

Na= \J {r{vj)ns) and Nb= \J {r{vj)ns)\NA 

each have cardinahty at least 2k, where we set the corresponding index to oo if this is not possible. 
Recall that /c = 7/6O • npt^anx by ([22]) and 7 > 180 by ([2T]) . So, since % holds, by ([6]) the maximum 
degree is at most Snptmax < k. Using k = n/60, we deduce that 

lA^AUiVfil < 6A; < li/lO. (81) 

8.1.1 Picking yl, 5 

If Ib = 00 or £b > n^^^ , we choose arbitrary disjoint sets, each of size k = n/60, satisfying 

A,B<ZS\{NAyjNB), 
which is possible by (j79p and (j8ip . For later usage, we furthermore set = and Ib = 0- 

If tB < n^'^'^ = o{k), we set Ia = {fi, • • • ,ve^} and = {v^^+i, . . . ,veg}. Since satisfies A/i, 
the codegrees are all bounded by nine, and thus 

\r{iB) n Na\ < \r{iB) n r{iA)\ <9-eB-£A< 9n^''' = o{k). (82) 

Now we choose arbitrary sets, each of size k, satisfying 

A<ZNa\{Ib^ T{Ib)) and B C Nb, 
which is possible by ([82]) . Clearly, A and BU Ib are disjoint. 

Next we estimate the size of certain neighbourhoods. A similar argument can be found in [24] . 
Lemma 8.1. We have r(/^)n5 = and T{lB)r\A = 0. Given Y G {A,B}, every v ^ ly satisfies 

\T{v) n y| < npn-^^. (83) 

Proof. 1{ ^B = oo, then all vertices v G [n] satisfy the stronger bound |r(?;) Ci {AU B)\ =0. 

Next, we consider the case n^'^^ < £b < oo, where Ia = Ib = 0- Since all vertices v G {vi, . . . , vgg} 
satisfy \T{v) n {AU B)\ = 0, using ([80]) it is not difficult to see that in order to prove ([83]) . it suffices 
to show |r(fj;) n 5| < npn~^'^ for x = n^'^'^ . Set H = {vi, . . . ,Vx}- On the one hand, using ([80]) 
we have 2e{H, S) > x\r{vx) H 5|. On the other hand, since G{i) satisfies /Cj, using \II\ = n"^^^ and 
|5| < npn^, we have, say, e{H,S) < npn^'^ . So, we deduce |r(f^) n 5| < npn~^^, as claimed. 

Finally, suppose that £b < n?^'^. Observe that r(/^)ni? = and r{lB)r\A = hold by construction. 
Fix Y G {A, B}. Since by Mi all codegrees are at most nine, for every v ^ ly have \T{v) r\Y\ < 
\T{v) n r(/y)| < 9£bi which readily establishes ([83]) . and thus completes the proof. □ 

8.1.2 Choosing R 

Observe that |/_b| < v?'^'^ . Considering A and S <— Ib, denote by Xi the set X whose existence is 
guaranteed by Vi^i. Similarly, let X2 and F denote the sets X and F whose existence is guaranteed 
by 7^2,1 when considering A and B. We have |^i|, \X2\ < fcn^^^ and \F\ < kn^'^ . Now we set 

R = {v]{JU\JXi\JX2. (84) 
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Clearly, \R\ < kn}^^'^ holds, with room to spare. Next we collect several structural properties. By 
dsn and dMD and have N^^\A,R) C N^i){A,Xi) n iV(j)(A, X2). So, using {T{Ib) U /b) n ^ = 0, 
we immediately obtain the following statement: 

Lemma 8.2. We have \Ib\ < 'n?^^ , and for every v ^ Ib there are at most (npY~^n^^^^ vertices 
w G N^^^^\A, R) for which there exists a path v = wq - ■ ■ wi-2 = w. □ 

In addition, using that A U i3 is an independent set, we readily deduce the following result: 

Lemma 8.3. We have \F\ < kn?"^ , and there are at most k'^(npY^^v}^^^ pairs {b,w) G i? x 
jyii-i) ^j^^ j^-^ which there exists a path b = wq - ■ ■ we-2 = w satisfying W2 ^ A or {wqWi , W1W2} D 
F = 0. □ 

In the subsequent sections, the construction of A and B is irrelevant; all that we use is that A, B 
are disjoint subsets of U with size k, and there are sets F, 7^, R such that the conclusions of 
Lemmas EIHO hold in 

8.2 The configuration S* is good 

In this section we show that = -i^i^j(S*) n -i^2,i(S*) holds. 

8.2.1 The bad event i3i,i(E*) 

In order to prove that fails, using Lemma 18.31 it suffices to show that there are at most 

k'^{npY~'^n~^^^ paths wq ■ ■ ■ Wi-2 with {wq,W2) ^ B x A satisfying wqWi G F or W1W2 G F. Let 
Ps* denote all such paths. For every wqWi £ F (1 E{i) with vjq G B, using Lemma ET] we see that 
wi ^ which by (|83p implies that there are at most npn~'^^ choices for W2 G r(?i;i) n A. With 
a similar argument, for every W1W2 G -F H E{i) with W2 £ A we have at most npn~^^ choices for 
Wq G T{wi) DB. Furthermore, since the degree is bounded by npn^, given W2 € A there are at most 
{npn'^Y^^ paths W2 - ■ ■ we^2- So, using np < k, \F\ < kn^'^ and (|78]l . i.e., > 20^, we deduce that 

IPs* I < npn-^' ■ 2\F\ ■ inpn^Y'^ < k^{npY~^n^^~^'^^ < k^{npY~\-^°' , 
which, as explained, establishes -^Bi^i{Y!,*). 

8.2.2 The bad event S2,i(S*) 

In anticipation of the estimates in Section 18.31 here we analyse the combinatorial structure of 
L's*{i) much more precisely than needed. To this end we introduce the sets LY,*{i,j), where for 
every j £ [i — 1] we denote by LY,*{i,j) the set of all ordered pairs xy with distinct x,y £ [n] such 
that \Cx,y,j:*{i, j)\ > p^^n^^^^^ . We start by showing that we may restrict our attention to the case 
j G {1,2}. Recall that Cx.y,T,*{i,j) contains all pairs bw £ B x N^^~^\A, R) for which there exist 
disjoint paths b = wi ■ ■ ■ Wj = x and y = Wj+i ■ ■ ■ = w in G{i). Fix x ^ y. Since the degree is at 
most npn^ by ([6]), for j > 3 the number of choices for w is at most {npn^Y ''^^ — {nprfY~^- Now, 
as there are at most \B\ < k < npn'^ ways to pick b £ B, using (npY""^ = p'"^ we crudely have 

|C^,y,E-(i,j)| < npn^ ■ (npn^Y'^ < p"^n^7(np) < p-^n'^^^' , (85) 
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which imphes xy ^ L^*{i,j)- Therefore L-£*{i,j) = for j > 3, so 

|Ls.«| < |Ls*(i,l)| + |Ls-(^,2)|. (86) 

With foresight, for all j > 1 we define M^^\A) as the set of t> G [n] with A)| > {npyn~'^^^ 

where VF^-') (w, A) contains all vertices w G A^(^^^) (yl, i?) for which there exists a path v = wq ■ ■ ■ wj = 
w in G(i). Now we claim that 

iE*(i,2) C {xy : X E /b A y e M(^-3)(A)} . (87) 

Note that Cx,y,T,*{h'^) contains only pairs bw £ B x N^^^^\A, R) for which there exists paths 
b = 71)11X12 = X and y = ws ■ ■ • W£ = w in G{i). First suppose that x ^ Is- Using Lemma \8A] by 
([83]) we have at most npn~^^ choices for b G r(x) n B. Since the degree is at most npn'^ , we have 
at most {npn^Y~^ choices for w. So, using we deduce that 

|C^,s/,s*(«,2)| < npn-^^ ■ {npn' f"^ < p-^n^^~^^' < p'^n-^^^^, 

which implies xy ^ L-£*{i,2). Next, we consider the case where y ^ M^^^^\A). With a very similar 
reasoning as above, this time using |14^(^~^)(y, ^)| < {npY~^n~'^^ and (j78p . i.e., r = 40£, we obtain 

|<^x,y,s*(«,2)| < npn^ ■ {npY~^n~^^ < p-^^^-^'^' < p-^n-^o^^ 

which implies xy ^ L-s*{i, 2). This completes the proof of ([87|) . 

By a similar but simpler argument we furthermore see that 

L^*{i,l) ^ [xy : xe B A y£ M(^-2)(^)| . (88) 

Next we estimate the cardinality of M^^\A). A similar argument is implicit in [3]. 
Lemma 8.4. For every l<j<£ — 2we have \M^^\A)\ < {npY~'^~^n'^^'^^ . 

Proof. Set ij(°)(yl) = A^(^-3)(yl,ii), and for every j > 1 we let H'^i\A) contain all v G [n] with 
\T{v) n H^'-'^'^\A)\ > npn~'^'^^ . First, we claim that for all 1 < j < ^ — 2 we have 

M^'j\A)<^H'^i\A). (89) 

Since r > 2£ by ()78p . it clearly suffices to show that for all 1 < j < ^ — 2, for every v ^ H^^\A) 
we have |l^'^-'''(v, ^)| < j{npn^yn~'^'^^ . We proceed by induction on j. For the base case j = 1 the 
claim is trivial, since H^^\A) contains all vertices v G [n] with \T{v) D N^^~^\A, R)\ > npn~'^'^^ . 
Turning to j > 2, fix u ^ H'^^\A). By distinguishing between the neighbours of v inside and outside 
of H^i-^\A), using the induction hypothesis and that the degree is bounded by nprf , we obtain 

\W^^\v,A)\ < npn"'^^'' ■ {npn^y^^ + nprf ■ (j - l)(npn^)-'~^n"^^^ < j{npn''yn~'^'"' , 

which, as explained, establishes ()89p . 

To finish the proof, again using r > 2i, it suffices to show that for all < j < ^ — 2 we have 

|i/(j')(^)| < (np)^~2-i^(2ir+^+j>_ (gQ) 

As before, we proceed by induction on j. Using |A| < A; < npn'^ and that the degree is bounded 
by npn^ ^ we establish the base case j = by observing that |i?(^^(^)| < |r(^~^)(^)| < {npn^Y~'^. 
Suppose j > 1. Recall that {npY'"^ = p~^- Since holds, using the induction hypothesis we obtain 

\H^^HA)\ < 16e-l(np)^-2-,^(2ir+£+,-l). < ^^^Y-2~,^^2jr+i+j)e^ 

completing the proof. □ 
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With Lemma 18.41 in hand, combing (j86p - (j88p with \B\ = k < npn^ as weh as \Ib\ < n^'''^, and then 
using ([1]), ([75]) as well as £ > 4, np = n^/^^^^^ and (np)^ < {npY'"^ = p~^, we deduce that 

\L^,{i)\ < npn' ■ r?'^' + n^""' ■ npn^'^' < npn''''' < {npfn-^'^^'^ < p~^n'^l^^'\ 

which establishes -1^2,1 (S*). 

8.3 Few tuples are ignored for S* 

In this section we estimate the size of Tx;*^£_3(z) \ Zx;*^£_3(i). Let QT,*{i) contain all pairs {wi,W() G 
B X N^^~'^\A, R) for which there exists a path wi - ■ ■ we with W2 G Is U M^^-^\A). We claim that 

|Ts.,^-3(i)\^s*,^-3WI < |Qs*(i)|. (91) 

Every tuple (uq, • • • ,^^£-2) £ ^s*,£-3(^) \ -^E*/-3(^) was ignored in one of the first i steps because 
(R2) failed. Recall that Cx,y,T.*{i, j) contains all pairs bw £ B x N^^~'^\A, R) for which there exist 
disjoint paths b = wi ■ ■ ■ wj = x and y = Wj+i ■ ■ ■ W£ = w in G{i). Observe that for every ignored 
tuple there exists i' < i, distinct x,y £ [n] and j £ [i — 1] with e^'+i = xy, fe-2 £ Cx,y,T,{i' ■,]) 
and \Cx,y,Y,{i\ j)\ > p^^n~^^^^ . So, since e^'+i = xy was added, for every such tuple there exists 
a path Vi-2 = wi - ■ ■ WjWj+i ■ ■ - we = ves with Wj = x and Wj+i = y in G{i' + 1) C G{i). Note 
that by monotonicity we have Cx^y,Y:*ii' , j) ^ Cx,y,T.*{h j), and therefore all such 'bad' pairs xy 
satisfy \Cx^y;T,*{'i, j)\ > p~^n~^^^'^. By the findings of Section [8.2.21 it thus suffices to consider 
Cx,y,T:*{h j) for xy £ LY,*{i,j) with j £ {1,2}, since for all others (i85]l holds. Now, using (fSTl) and 
(f88|) . it is not difficult to see that the corresponding paths w^_2 = wi ■ ■ ■ = ves satisfy wi £ B, 
W2 £ IbIJM^^-^\A) and We £ N^^ ^\A, R). Putting things together, the extension property Z^t (cf. 
Lemma HTT]) implies (f9T]) . since every (vq, . . . ,ve-2) G Js*^£_3(i) \ .^s*/-3(0 is uniquely determined 
by the pair fe-2 = ve-zve-2- 

Let Qj:*j{i) and (3e*,m(^) contain all pairs {wi,we) £ Qs'ii) where at least one corresponding path 
wi---we satisfies W2 £ Ib and W2 £ M^^~'^\A) \ Ib, respectively. Now, using (p2]) and ([9T]) . to 
establish ([29]) . it suffices to prove, say, 

max{|QE*,7(i)|, |QE.,Af(OI} < (92) 

Using Lemma |8.2|, < n'^^'^ and that the degree is at most npn'^, we obtain, with room to spare, 

\Qs',l{i)\ < npTf ■ \Ib\ ■ {npf-^n^'''' < M^-^n^i^^+^'^+i)^ < {npY-^n~^''. 

Turning to QT,*,M{i), note that for every W2 £ M^^~'^\A) \ Ib we have |r(i(;2) r) B\ < npn~'^'^ by 
(I83p . With a similar argument as above, using Lemma El i.e., \M^'^-'^\A)\ < 

j^'iire^ we see that 

\Qs*,M(.i)\ < npn-^' ■ |M(^-2)(^)| • {npn^f-^ < (npf-^n^'^^^"-^-^'^^ < {npf-^n-^^', 

where the last inequality follows from (j78p . i.e., "d = 20^r. This establishes (|92p . which, as explained, 
completes the proof of Lemma 14.31 □ 
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